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UNIT — I INTRODUCTION TO DATA STRUCTURES, 
SEARCHING AND SORTING 


Basic Concepts: Introduction to Data Structures: 


A data structure is a way of storing data in a computer so that it can be used efficiently and it will allow 
the most efficient algorithm to be used. The choice of the data structure begins from the choice of an 
abstract data type (ADT). A well-designed data structure allows a variety of critical operations to be 
performed, using as few resources, both execution time and memory space, as possible. Data structure 
introduction refers to a scheme for organizing data, or in other words it is an arrangement of data in 
computer's memory in such a way that it could make the data quickly available to the processor for 
required calculations. 


A data structure should be seen as a logical concept that must address two fundamental concerns. 


1. First, how the data will be stored, and 
2. Second, what operations will be performed on it. 


As data structure is a scheme for data organization so the functional definition of a data structure should 
be independent of its implementation. The functional definition of a data structure is known as ADT 
(Abstract Data Type) which is independent of implementation. The way in which the data is organized 
affects the performance of a program for different tasks. Computer programmers decide which data 
structures to use based on the nature of the data and the processes that need to be performed on that 
data. Some of the more commonly used data structures include lists, arrays, stacks, queues, heaps, trees, 
and graphs. 


Classification of Data Structures: 
Data structures can be classified as 


e Simple data structure 
e Compound data structure 
e Linear data structure 


e Non linear data structure 


Data Structures 
Simple Data Structures Campound D ata Stricture 
Linear Data Structures Non-Linear Data Strucutres 


stack Tree 
HUELE Graph 
Lists 


[Fig 1.1 Classification of Data Structures] 


Simple Data Structure: 
Simple data structure can be constructed with the help of primitive data structure. A primitive data 
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structure used to represent the standard data types of any one of the computer languages. Variables, 
arrays, pointers, structures, unions, etc. are examples of primitive data structures. 

Compound Data structure: 

Compound data structure can be constructed with the help of any one of the primitive data structure and 
it is having a specific functionality. It can be designed by user. It can be classified as 


e Linear data structure 
e Non-linear data structure 


Linear Data Structure: 


Linear data structures can be constructed as a continuous arrangement of data elements in the memory. 
It can be constructed by using array data type. In the linear Data Structures the relationship of adjacency 
is maintained between the data elements. 


Operations applied on linear data structure: 
The following list of operations applied on linear data structures 


1. Add an element 

2. Delete an element 

3. Traverse 

4. Sort the list of elements 

5. Search for a data element 

For example Stack, Queue, Tables, List, and Linked Lists. 


Non-linear Data Structure: 


Non-linear data structure can be constructed as a collection of randomly distributed set of data item 
joined together by using a special pointer (tag). In non-linear Data structure the relationship of 
adjacency is not maintained between the data items. 


Operations applied on non-linear data structures: 

The following list of operations applied on non-linear data structures. 
1. Add elements 

2. Delete elements 

3. Display the elements 

4. Sort the list of elements 

5. Search for a data element 

For example Tree, Decision tree, Graph and Forest 


Abstract Data Type: 


An abstract data type, sometimes abbreviated ADT, is a logical description of how we view the data 
and the operations that are allowed without regard to how they will be implemented. This means that 
we are concerned only with what data is representing and not with how it will eventually be 
constructed. By providing this level of abstraction, we are creating an encapsulation around the data. 
The idea is that by encapsulating the details of the implementation, we are hiding them from the user’s 
view. This is called information hiding. The implementation of an abstract data type, often referred to 
as a data structure, will require that we provide a physical view of the data using some collection of 
programming constructs and primitive data types. 
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User 






Interface 





Operations 


[Fig. 1.2: Abstract Data Type (ADT)] 


Algorithms: 


Structure and Properties of Algorithm: 


An algorithm has the following structure 


l. 
2: 
3. 
4. 
5, 


Input Step 
Assignment Step 
Decision Step 
Repetitive Step 


Output Step 


An algorithm is endowed with the following properties: 


l. 


2, 


Finiteness: An algorithm must terminate after a finite number of steps. 
Definiteness: The steps of the algorithm must be precisely defined or unambiguously specified. 
Generality: An algorithm must be generic enough to solve all problems of a particular class. 


Effectiveness: the operations of the algorithm must be basic enough to be put down on pencil and 
paper. They should not be too complex to warrant writing another algorithm for the operation. 


Input-Output: The algorithm must have certain initial and precise inputs, and outputs that may be 
generated both at its intermediate and final steps. 


Different Approaches to Design an Algorithm: 


An algorithm does not enforce a language or mode for its expression but only demands adherence to its 
properties. 


Practical Algorithm Design Issues: 


l. 


To save time (Time Complexity): A program that runs faster is a better program. 


2. To save space (Space Complexity): A program that saves space over a competing program is 
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considerable desirable. 
Efficiency of Algorithms: 


The performances of algorithms can be measured on the scales of time and space. The performance of 
a program is the amount of computer memory and time needed to run a program. We use two 
approaches to determine the performance of a program. One is analytical and the other is experimental. 
In performance analysis we use analytical methods, while in performance measurement we conduct 
experiments. 


Time Complexity: The time complexity of an algorithm or a program is a function of the running time 
of the algorithm or a program. In other words, it is the amount of computer time it needs to run to 
completion. 


Space Complexity: The space complexity of an algorithm or program is a function of the space needed 
by the algorithm or program to run to completion. 


The time complexity of an algorithm can be computed either by an empirical or theoretical approach. 
The empirical or posteriori testing approach calls for implementing the complete algorithms and 
executing them on a computer for various instances of the problem. The time taken by the execution of 
the programs for various instances of the problem are noted and compared. The algorithm whose 
implementation yields the least time is considered as the best among the candidate algorithmic 
solutions. 


Analyzing Algorithms 


Suppose M is an algorithm, and suppose n 1s the size of the input data. Clearly the complexity f(n) of M 
increases as n increases. It is usually the rate of increase of f(n) with some standard functions. The most 
common computing times are 


O(1), O(log, n), O(n), O(n log, n), O(n’), O(n’), O(2") 


Example: 
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Program SegmentaA 


Program Segment B 


Program Segment C 


forj=ltondo 
for x = 1 to n do 


X=X +2; 





Total Frequency Count of Program Segment À 


Program Statements 


Total Frequency Count 
Total Frequency Count of Program Segment È 


Program Statements 


Total Frequency Count 


Total Frequency Count ofProgram Segment C 


Program Statements 


forj=ltondo 
forx=ltondo 


X =X +2: 


Total Frequency Count 





Frequency Count 


Frequency Count 


(n+ 1) 


3n+1 


Frequency Count 


(n+1) 


n(n+1) 


3n*+3n+1 


The total frequency counts of the program segments A, B and C given by 1, (3n+1) and (3n*+3n+1) 
respectively are expressed as O(1), O(n) and O(n’). These are referred to as the time complexities of the 
program segments since they are indicative of the running times of the program segments. In a similar 
manner space complexities of a program can also be expressed in terms of mathematical notations, 


Download FREE Computer Science Notes at TutorialsDuniya.com 


Download FREE Computer Science Notes at TutorialsDuniya.com 


which is nothing but the amount of memory they require for their execution. 
Asymptotic Notations: 


It is often used to describe how the size of the input data affects an algorithm’s usage of computational 
resources. Running time of an algorithm is described as a function of input size n for large n. 


Big oh(Q): Definition: f(n) = O(g(n)) (read as f of n is big oh of g of n) if there exist a positive integer 
no and a positive number c such that |f(n)| < clg(n)| for all n > no. Here g(n) is the upper bound of the 
function f(n). 


cein) 
f(n) g(n) 


sa f(n} = O(n ) 





34n- 40 n fin) = O(n) 


50 1 f(n) = O(1) 





fi 
no J Cin) = OCe(1)) 


Omega(Q): Definition: f(n) = Q(g(n)) ( read as f of n is omega of g of n), if there exists a positive 
integer no and a positive number c such that |f(n)| > c |g(n)| for all n > no, Here g(n) is the lower bound 
of the function f(n). 






16n +8n +2 f(n) = Q(n ) 


24n+9 n f(n) = Q(n) 





ee 





mg f(n) = S2Ce2@()) 


Theta(®): Definition: f(n) = O(g(n)) (read as f of n is theta of g of n), if there exists a positive integer 
no and two positive constants cı and cz such that cı |g(n)| < |f(n)| < cz |g(n)| for all n > no, The function 
g(n) is both an upper bound and a lower bound for the function f(n) for all values of n, n> no. 
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f(n) 


g(n) 
16n* + 30n*- : 
n“ 


f(n) = O(n* ) 


fin) = © (2°) 








r 
n oo 
2 Tn) = OCH) 


Little oh(o): Definition: f(n) = O(g(n)) (read as f of n is little oh of g of n), if f) = O(g(n)) and f(n) 4 
(g(n)). 
Time Complexity: 


Time Complexities of various Algorithms: 


Numerical Comparision of Different Algorithms: 





Reasons for analyzing algorithms: 


1. To predict the resources that the algorithm requires 
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e Computational Time(CPU consumption). 
e Memory Space(RAM consumption). 
e Communication bandwidth consumption. 
2. To predict the running time of an algorithm 
e Total number of primitive operations executed. 
Recursive Algorithms: 


GCD Design: Given two integers a and b, the greatest common divisor is recursively found using the 


formula 
gcd(a,b) = Ja if b=0 


b if a=0 
d(b, a mod b) } 


Fibonacci Design: To start a fibonacci series, we need to know the first two numbers. 


Fibonacci(n) = (0 if n=0 \ 
ase case 


1 if n=] 
ibonacci(n-1) + fibonacci(n-2) } 
Difference between Recursion and Iteration: 


1. A function is said to be recursive if it calls itself again and again within its body whereas iterative 
functions are loop based imperative functions. 


2. Reursion uses stack whereas iteration does not use stack. 

3. Recursion uses more memory than iteration as its concept is based on stacks. 

4. Recursion is comparatively slower than iteration due to overhead condition of maintaining stacks. 
5. Recursion makes code smaller and iteration makes code longer. 


6. Iteration terminates when the loop-continuation condition fails whereas recursion terminates when 
a base case is recognized. 


7. While using recursion multiple activation records are created on stack for each call where as in 
iteration everything is done in one activation record. 


8. Infinite recursion can crash the system whereas infinite looping uses CPU cycles repeatedly. 
9. Recursion uses selection structure whereas iteration uses repetetion structure. 
Types of Recursion: 


Recursion 1s of two types depending on whether a function calls itself from within itself or whether two 
functions call one another mutually. The former is called direct recursion and the later is called 
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indirect recursion. Thus there are two types of recursion: 
e Direct Recursion 
e Indirect Recursion 
Recursion may be further categorized as: 
e Linear Recursion 
e Binary Recursion 
e Multiple Recursion 
Linear Recursion: 


It is the most common type of Recursion in which function calls itself repeatedly until base condition 
[termination case] is reached. Once the base case is reached the results are return to the caller function. 
If a recursive function is called only once then it is called a linear recursion. 


J 
Vi 


Binary Recursion: 


Some recursive functions don't just have one call to themselves; they have two (or more). Functions 
with two recursive calls are referred to as binary recursive functions. 


Examplel: The Fibonacci function fib provides a classic example of binary recursion. The Fibonacci 
numbers can be defined by the rule: 


fib(n) = O if nis O, 
= 1 ifnisl, 
= fib(n-1) + fib(n-2) otherwise 
For example, the first seven Fibonacci numbers are 
Fib(O) = 0 
Fib(1) = 1 
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Fib(2) = Fib(1) + Fib(0) = 1 
Fib(3) = Fib(2) + Fib(1) = 2 
Fib(4) = Fib(3) + Fib(2) = 3 
Fib(5) = Fib(4) + Fib(3) = 5 
Fib(6) = Fib(5) + Fib(4) = 8 


FG) 3 
Pie ee 


F¢2) Fol) Fel) FCO) F(1) FCO) 


ae 


Fil) Fro) 
# Program to display the Fibonacci sequence up to n-th term where n is provided by the user 
# change this value for a different result 
nterms = 10 
# uncomment to take input from the user 
#nterms = int(input("How many terms? ")) 
# first two terms 
nl =0 
n2=1 
count = 0 
# check if the number of terms is valid 
if nterms <= 0: 
print("'Please enter a positive integer") 
elif nterms == 1: 
print("Fibonacci sequence upto" ,nterms,":") 
print(n1) 


else: 
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mon 


print(""Fibonacci sequence upto" ,nterms,":") 
while count < nterms: 

print(nl,end=", ') 

nth = n1 + n2 


# update values 


nl=n2 
n2 = nth 
count += 1 


Tail Recursion: 


Tail recursion is a form of linear recursion. In tail recursion, the recursive call is the last thing the 
function does. Often, the value of the recursive call is returned. As such, tail recursive functions can 
often be easily implemented in an iterative manner; by taking out the recursive call and replacing it with 
a loop, the same effect can generally be achieved. In fact, a good compiler can recognize tail recursion 
and convert it to iteration in order to optimize the performance of the code. 


A good example of a tail recursive function is a function to compute the GCD, or Greatest Common 
Denominator, of two numbers: 


def factorial(n): 
if n == Q: return 1 


else: return factorial(n-1) * n 


def tail_factorial(n, accumulator=1): 
if n == 0: return | 


else: return tail_factorial(n-1, accumulator * n) 

Recursive algorithms for Factorial, GCD, Fibonacci Series and Towers of Hanoi: 
Factorial(n) 

Input: integer n > 0 

Output: n! 

1. If n = O then return (1) 

2, else return prod(n, factorial(n — 1)) 

GCD(m, n) 

Input: integers m > 0, n> 0 

Output: gcd (m, n) 
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l. If n = O then return (m) 

2. else return gcd(n,m mod n) 

Time-Complexity: O(n n) 

Fibonacci(n) 

Input: integer n > 0 

Output: Fibonacci Series: 1 1 2 3 5 8 13.......... cece cece cece ee eeeee 
l. if n=1 or n=2 

2 then Fibonacci(n)=1 

3. else Fibonacci(n) = Fibonacci(n-1) + Fibonacci(n-2) 

Towers of Hanoi 


Input: The aim of the tower of Hanoi problem is to move the initial n different sized disks from needle 
A to needle C using a temporary needle B. The rule is that no larger disk is to be placed above the 
smaller disk in any of the needle while moving or at any time, and only the top of the disk is to be 
moved at a time from any needle to any needle. 


Output: 

L. If n=1, move the single disk from A to C and return, 

2: If n>1, move the top n-1 disks from A to B using C as temporary. 
i Move the remaining disk from A to C. 

4. Move the n-1 disk disks from B to C, using A as temporary. 





def TowerOfHanoi(n , from_rod, to_rod, aux_rod): 
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ifn == 1: 
print "Move disk 1 from rod" ,from_rod,"to rod",to_rod 
return 
TowerOfHanoi(n-1, from_rod, aux_rod, to_rod) 
print "Move disk",n,"from rod" ,from_rod,"to rod" ,to_rod 
TowerOfHanoi(n-1, aux_rod, to_rod, from_rod) 
n=4 
TowerOfHanoi(n, 'A', 'C’, 'B’) 
Searching Techniques: 


Linear Search: Searching is a process of finding a particular data item from a collection of data items 
based on specific criteria. Every day we perform web searches to locate data items containing in various 
pages. A search typically performed using a search key and it answers either True or False based on the 
item is present or not in the list. Linear search algorithm is the most simplest algorithm to do sequential 
search and this technique iterates over the sequence and checks one item at a time, until the desired item 
is found or all items have been examined. In Python the in operator is used to find the desired item in a 
sequence of items. The in operator makes searching task simpler and hides the inner working details. 


if key in theArray : 

print( "The key is in the array." ) 
else : 

print( "The key is not in the array." ) 


>>> 15 in [3,5,2,4,1] 
False 

>>> 3 in [3,5,2,4,1] 
True 

a 


Consider an unsorted single dimensional array of integers and we need to check whether 31 is present in 
the array or not, then search begins with the first element. As the first element doesn't contain the desired 
value, then the next element is compared to value 31 and this process continues until the desired element 
is found in the sixth position. Similarly, if we want to search for 8 in the same array, then the search 
begins in the same manner, starting with the first element until the desired element is found. In linear 
search, we cannot determine that a given search value is present in the sequence or not until the entire 
array is traversed. 
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Source Code: 


def linear_search(obj, item, start=0): 
for 1 in range(start, len(obj)): 
if obj[i] == item: 
return 1 
return -1 
arr=[1,2,3,4,5,6,7,8] 
x=4 
result=linear_search(arr,x) 
if result==-1: 
print ("element does not exist") 
else: 
print ("element exist in position %d" %result) 
Time Complexity of Linear Search: 


Any algorithm is analyzed based on the unit of computation it performs. For linear search, we need to 
count the number of comparisons performed, but each comparison may or may not search the desired 


If item is present 


item. 


If item is not present 





Binary Search: In Binary search algorithm, the target key is examined in a sorted sequence and this 
algorithm starts searching with the middle item of the sorted sequence. 


a. If the middle item is the target value, then the search item 1s found and it returns True. 
b. If the target item < middle item, then search for the target value in the first half of the list. 
c. If the target item > middle item, then search for the target value in the second half of the list. 


In binary search as the list is ordered, so we can eliminate half of the values in the list in each iteration. 
Consider an example, suppose we want to search 10 in a sorted array of elements, then we first determine 
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the middle element of the array. As the middle item contains 18, which is greater than the target value 10, 
so can discard the second half of the list and repeat the process to first half of the array. This process is 
repeated until the desired target item is located in the list. If the item is found then it returns True, 
otherwise False. 





aoo00) 0000 





Searching for 10 in a sorted array using Binary Search 


Source Code: 


array =[1,2,3,4,5,6,7,8,9] 


def binary_search(searchfor, array): 
lowerbound=0 
upperbound=len(array)- 1 
found=False 
while found==False and lowerbound<=upperbound: 
midpoint=(lowerbound+upperbound)//2 
if array[midpoint]==searchfor: 
found =True 
return found 
elif array[midpoint]<searchfor: 
lowerbound=midpoint+ 1 
else: 
upperbound=midpoint-1 
return found 


searchfor=int(input("what are you searching for?")) 
if binary_search(searchfor, array): 
print ("element found") 
else: 
print ("element not found") 
Time Complexity of Binary Search: 


In Binary Search, each comparison eliminates about half of the items from the list. Consider a list with n 
items, then about n/2 items will be eliminated after first comparison. After second comparison, n/4 items 
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of the list will be eliminated. If this process is repeated for several times, then there will be just one item 
left in the list. The number of comparisons required to reach to this point is n/2' = 1. If we solve for i, then 
it gives us i = log n. The maximum number is comparison is logarithmic in nature, hence the time 
complexity of binary search is O(log n). 


Fibonacci Search: It is a comparison based technique that uses Fibonacci numbers to search an element 
in a sorted array. It follows divide and conquer approach and it has a O(log n) time complexity. Let the 
element to be searched is x, then the idea is to first find the smallest Fibonacci number that is greater than 
or equal to length of given array. Let the Fibonacci number be fib(n" Fibonacci number). Use (n-2)th 
Fibonacci number as index and say it is 1, then compare a[i] with x, if x is same then return 1. Else if x is 
greater, then search the sub array after 1, else search the sub array before 1. 





Source Code: 


# Python3 program for Fibonacci search. 
from bisect import bisect_left 


# Returns index of x if present, else 
# returns -1 
def fibMonaccianSearch(arr, x, n): 


# Initialize fibonacci numbers 

fibMMm2 = 0 # (m-2)'th Fibonacci No. 
fibMMm1 = 1 # (m-1)'th Fibonacci No. 

fibM = fibMMm2 + fibMMm1 # m'th Fibonacci 


# fibM is going to store the smallest 
# Fibonacci Number greater than or equal to n 
while (fibM < n): 

fibMMm2 = fibMMm1 

fibMMm1 = fibM 

fibM = fibMMm2 + fibMMm1 


# Marks the eliminated range from front 
offset = -1; 


# while there are elements to be inspected. 
# Note that we compare arr[fibMm2] with x. 


# When fibM becomes 1, fibMm2 becomes 0 
while (fibM > 1): 
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# Check if fibMm2 1s a valid location 
i = min(offset+fibMMmz?2, n-1) 


# If x is greater than the value at 
# index fibMm2, cut the subarray array 
# from offset to 1 
if (arr[i] < x): 
fibM = fibMMm1 
fibMMm1 = filbMMm2 
fibMMm2 = fibM - fibMMm1 
offset =1 


# If x is greater than the value at 

# index fibMm2, cut the subarray 

# after 1+1 

elif (arr[i] > x): 
fibM = fibMMm2 
fibMMm1 = fibMMm1 - fibMMm2 
fibMMm2 = fibM - fibMMm1 


# element found. return index 
else: 
return 1 


# comparing the last element with x */ 
if(fibMMm1 and arr[offset+1] == x): 
return offset+1; 


# element not found. return -1 
return -1 


# Driver Code 

arr = [10, 22, 35, 40, 45, 50, 80, 82, 85, 90, 100] 

n = len(arr) 

x = 80 

print(" Found at index:", 
fibMonaccianSearch(arr, x, n)) 

Time Complexity of Fibonacci Search: 


Time complexity for Fibonacci search is O(log) n) 
Sorting Techniques: 


Sorting in general refers to various methods of arranging or ordering things based on criteria's 
(numerical, chronological, alphabetical, hierarchical etc.). There are many approaches to sorting data and 
each has its own merits and demerits. 
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Bubble Sort: 


This sorting technique is also known as exchange sort, which arranges values by iterating over the list 
several times and in each iteration the larger value gets bubble up to the end of the list. This algorithm 
uses multiple passes and in each pass the first and second data items are compared. if the first data item is 
bigger than the second, then the two items are swapped. Next the items in second and third position are 
compared and if the first one is larger than the second, then they are swapped, otherwise no change in 
their order. This process continues for each successive pair of data items until all items are sorted. 


Bubble Sort Algorithm: 
Step 1: Repeat Steps 2 and 3 for i=1 to 10 
Step 2: Set j=1 
Step 3: Repeat while j<=n 
(A) if afi] < a[l] 
Then interchange ali] and a[j] 
[End of if] 
(B) Set j =j+l 
[End of Inner Loop] 
[End of Step 1 Outer Loop] 


Step 4: Exit 
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Various Passes of Bubble Sort 


Source Code: 
# Python program for implementation of Bubble Sort 


def bubbleSort(arr): 
n = len(arr) 


# Traverse through all array elements 
for 1 in range(n): 


# Last 1 elements are already in place 
for j in range(0, n-1-1): 


# traverse the array from 0 to n-1-1 

# Swap if the element found is greater 
# than the next element 

if arr[j] > arr[j+1] : 
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arr[j], arr[j+1] = arr[j+1], arr[j] 


# Driver code to test above 
arr = [64, 34, 25, 12, 22, 11, 90] 


bubbleSort(arr) 


print ("Sorted array 1s:") 
for 1 in range(len(arr)): 
print ("%d" %arr[i]) 


Step-by-step example: 


Let us take the array of numbers "5 1 4 2 8", and sort the array from lowest number to greatest number 
using bubble sort. In each step, elements written in bold are being compared. Three passes will be 
required. 


First Pass: 

(51428) (15428), Here, algorithm compares the first two elements, and swaps since 5 > 1. 
(15428) (14528), Swap since 5> 4 

(14528) (14258), Swap since 5 > 2 

(14258) 


(14258), Now, since these elements are already in order (8 > 5), algorithm does not swap them. 


Second Pass: 


(14258) (14258) 
(14258) (12458), Swap since 4 > 2 
(12458) (12458) 
(12458) (124358) 


Now, the array is already sorted, but our algorithm does not know if it is completed. The algorithm needs 
one whole pass without any swap to know it is sorted. 


Third Pass: 

(12458) (12458) 
(12458) (12458) 
(12458) (12458) 


20 


Download FREE Computer Science Notes at TutorialsDuniya.com 


Download FREE Computer Science Notes at TutorialsDuniya.com 


(12458) (12458) 
Time Complexity: 


The efficiency of Bubble sort algorithm is independent of number of data items in the array and its initial 
arrangement. If an array containing n data items, then the outer loop executes n-1 times as the algorithm 
requires n-1 passes. In the first pass, the inner loop is executed n-1 times; in the second pass, n-2 times; in 
the third pass, n-3 times and so on. The total number of iterations resulting in a run time of O( n’). 


Worst Case Performance O(n’) 
Best Case Performance O(n’) 
Average Case Performance O(n’) 


Selection Sort: 

Selection sort algorithm is one of the simplest sorting algorithm, which sorts the elements in an array by 
finding the minimum element in each pass from unsorted part and keeps it in the beginning. This 
sorting technique improves over bubble sort by making only one exchange in each pass. This sorting 
technique maintains two sub arrays, one sub array which is already sorted and the other one which is 
unsorted. In each iteration the minimum element (ascending order) is picked from unsorted array and 
moved to sorted sub array.. 


Selection Sort Algorithm: 


Fori=1ton-1 /*where nis the size of the array */ 
set: min_index=1_ /*here we took starting index as 1 */ 
For j=i+1 ton 
If arr[j]<arr[min_index] 
set: min_index=j 
Endlf 
EndFor 
swap (arr[i] and swap[min_index]) 
EndFor 
). Exit. 


Source Code: 


# Python program for implementation of Selection 
# Sort 

import sys 

A = [64, 25, 12, 22, 11] 


# Traverse through all array elements 
for 1 in range(len(A)): 
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# Find the minimum element in remaining 
# unsorted array 
min_idx =1 
for j in range(i+1, len(A)): 
if A[min_idx] > Afj]: 
min_idx =J 


# Swap the found minimum element with 
# the first element 
Af{ij, A[min_idx] = A[min_idx], A[1] 


# Driver code to test above 

print ("Sorted array") 

for 1 in range(len(A)): 
print("%d" %A[i]) 

Output: 


Enter array size:6 

Enter the elements:96 94 81 56 76 45 

The elements after sorting are: 45 56 76 81 94 96 
Step-by-step example: 

Here is an example of this sort algorithm sorting five elements: 
64 25 122211 

11 25 12 22 64 

11 12 25 22 64 

11 12 22 25 64 

11 12 22 25 64 

Time Complexity: 


Selection sort is not difficult to analyze compared to other sorting algorithms since none of the loops 
depend on the data in the array. Selecting the lowest element requires scanning all n elements (this takes 
n— | comparisons) and then swapping it into the first position. Finding the next lowest element requires 
scanning the remaining n — 1 elements and so on, for (n- 1)+(n-2)+...4+24+1=n(n—1)/2€ O(n’) 
comparisons. Each of these scans requires one swap for n — 1 elements (the final element is already in 
place). 


Worst Case Performance O(n’) 


Best Case Performance O(n’) 
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Average Case Performance O(n’) 
Insertion Sort: 


An algorithm consider the elements one at a time, inserting each in its suitable place among those already 
considered (keeping them sorted). Insertion sort is an example of an incremental algorithm. It builds the 
sorted sequence one number at a time. This is a suitable sorting technique in playing card games. 
Insertion sort provides several advantages: 


e Simple implementation 
e Efficient for (quite) small data sets 


e Adaptive (1.e., efficient) for data sets that are already substantially sorted: the time complexity is 
O(n + d), where d is the number of inversions 


e More efficient in practice than most other simple quadratic (1.e., O(n’)) algorithms such 
as selection sort or bubble sort; the best case (nearly sorted input) is O(n) 


e Stable; 1.e., does not change the relative order of elements with equal keys 
e In-place; i.e., only requires a constant amount O(1) of additional memory space 


e Online; 1.e., can sort a list as it receives it 


7 03) oe) 00) on 
oOo c a e o 
E A e a a a o 
M7 6 oe) Sooo 
1703) Ge Goo) 
a | Bimm 
GANE a o 
S C o a 
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Source Code: 


# Python program for implementation of Insertion Sort 


# Function to do insertion sort 
def insertionSort(arr): 


# Traverse through 1 to len(arr) 
for 1 in range(1, len(arr)): 
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key = arr[i] 


# Move elements of arr[0..i1-1], that are 
# greater than key, to one position ahead 
# of their current position 
j=rl 
while j >=0 and key < arr[j] : 

arr[j+1] = arr[j] 

j-=1 
arr[j+1] = key 


# Driver code to test above 

arr = [ 12, 11, 13, 5, 6] 

insertionSort(arr) 

print ("Sorted array 1s:") 

for 1 in range(len(arr)): 
print ("%d" %arr[1]) 


Step-by-step example: 








a 
Checking second element of 

' array with element before 

| bo it and inserting it in proper 

position. In this case, 3 is 

inserted in position of 12. 





Checking third element of 
array with elements before 
it and inserting it in proper 
position. In this case, 1 is 
inserted in position of 3. 


Step 2 3 12 1 5 8 


[ | ) Checking fourth element of 
. 2 $ array with elements before 
Step 3 l 3 12 5 8 it and inserting it in proper 


position. In this case, 5 is 
inserted in position of 12. 


Checking fifth element of 
array with elements before 
it and inserting it in proper 
position. In this case, 8 is 
inserted in position of 12. 





Sorted Array in Ascending Order 








Figure: Sorting Array in Ascending Order Using Insertion Sort Algorithm 
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Suppose, you want to sort elements in ascending as in above figure. Then, 


1. The second element of an array is compared with the elements that appear before it (only first 
element in this case). If the second element is smaller than first element, second element is 
inserted in the position of first element. After first step, first two elements of an array will be 
sorted. 


2. The third element of an array is compared with the elements that appears before it (first and 
second element). If third element is smaller than first element, it is inserted in the position of first 
element. If third element is larger than first element but, smaller than second element, it is 
inserted in the position of second element. If third element is larger than both the elements, it is 
kept in the position as it is. After second step, first three elements of an array will be sorted. 


3. Similarly, the fourth element of an array is compared with the elements that appear before it (first, 
second and third element) and the same procedure is applied and that element is inserted in the 
proper position. After third step, first four elements of an array will be sorted. 


If there are n elements to be sorted. Then, this procedure is repeated n-/ times to get sorted list of array. 
Time Complexity: 
Worst Case Performance O(n’) 


Best Case Performance(nearly) O(n) 





Average Case Performance 


Output: 

Enter no of elements:5 

Enter elements:1 65 0 32 66 
Elements after sorting: 0 1 32 65 66 
Quick Sort: 


Quick sort is a divide and conquer algorithm. Quick sort first divides a large list into two smaller sub- 
lists: the low elements and the high elements. Quick sort can then recursively sort the sub-lists. 


The steps are: 
1. Pick an element, called a pivot, from the list. 


2. Reorder the list so that all elements with values less than the pivot come before the pivot, while 
all elements with values greater than the pivot come after it (equal values can go either way). 
After this partitioning, the pivot is in its final position. This is called the partition operation. 


3. Recursively apply the above steps to the sub-list of elements with smaller values and separately 
the sub-list of elements with greater values. 


The base case of the recursion 1s lists of size zero or one, which never need to be sorted. 
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Quick sort, or partition-exchange sort, is a sorting algorithm developed by Tony Hoare that, on 
average, makes O(n log n) comparisons to sort n items. In the worst case, it makes O(n’) comparisons, 
though this behavior is rare. Quick sort is often faster in practice than other O(n log n) algorithms. It 
works by first of all by partitioning the array around a pivot value and then dealing with the 2 smaller 
partitions separately. Partitioning is the most complex part of quick sort. The simplest thing is to use the 
first value in the array, a[l] (or a[0] as | = O to begin with) as the pivot. After the partitioning, all values to 
the left of the pivot are <= pivot and all values to the right are > pivot. The same procedure for the two 
remaining sub lists is repeated and so on recursively until we have the entire list sorted. 


Advantages: 
e One of the fastest algorithms on average. 


e Does not need additional memory (the sorting takes place in the array - this is called in-place 
processing). 


Disadvantages: The worst-case complexity is O(N’) 
Source Code: 
# Python program for implementation of Quicksort Sort 
# This function takes last element as pivot, places 
# the pivot element at its correct position in sorted 


# array, and places all smaller (smaller than pivot) 
# to left of pivot and all greater elements to right 


# of pivot 
def partition(arr,low, high): 
1 = ( low-1 ) # index of smaller element 


pivot = arr[high] # pivot 
for j in range(low , high): 


# If current element is smaller than or 
# equal to pivot 
if arr[j] <= pivot: 


# increment index of smaller element 
1=1+1 
arr[i],arr[j] = arr[j],arr[1] 


arr[i+1],arr[high] = arr[high],arr[i+1] 
return (i+1 ) 


# The main function that implements QuickSort 
# arr[] --> Array to be sorted, 

# low --> Starting index, 

# high --> Ending index 
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# Function to do Quick sort 
def quickSort(arr,low, high): 
if low < high: 


# pi is partitioning index, arr[p] is now 
# at right place 
pi = partition(arr,low, high) 


# Separately sort elements before 
# partition and after partition 
quickSort(arr, low, pi-1) 
quickSort(arr, pit+1, high) 


# Driver code to test above 
arr = [10, 7, 8, 9, 1, 5] 
n = len(arr) 
quickSort(arr,0,n-1) 
print ("Sorted array 1s:") 
for 1 in range(n): 

print ("%d" %arr[1]) 


Step-by-step example: 
1 2 3 4 5 6 7 8 9 10 11 12 13 Remarks 
38 08 16 06 79 57 24 56 02 58 04 70 45 


Pivot 08 16 06 Up 57 24 56 02 58 Dn 70 45 Swap up and 
down 


Pivot 08 16 06 04 57 24 56 02 58 79 70 45 


Pivot 08 16 06 04 Up 24 56 Dn 58 79 70 45 Swap up and 
down 


Pivot 08 16 06 04 02 24 56 57 58 79 70 45 


Pivot 08 16 06 04 02 Dn Up 57 58 79 70 45 Swap pivot 
and down 


24 08 16 06 04 02 38 56 57 58 79 70 45 


Pivot 08 16 06 04 Dn Up 56 57 58 79 70 45 Swap pivot 
and down 


(02 08 16 06 04 24 38 (56 57 58 79 70 45) 


Pivot 08 16 06 04 Up 
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dn 


Pivot Up 06 Dn Swap up and 
down 


Pivot 04 06 16 


Pivot 04 Dn Up Swap pivot 
and down 

06 04 08 

Pivot Dn Up Swap pivot 
and down 

04 06 


(02 04 06 08 16 24 38) (56 57 58 79 70 45) 


Pivot Up 58 79 70 Dn Swap up and 
down 


Pivot 45 58 i \ () 57 


Pivot Dn Up 79 70 57 Swap pivot 
and down 


(45) 56 (58 79 70 57) 


Pivot Up 70 Dn Swap up and 
down 


Pivot 57 70 79 


Pivot Dn Up 79 Swap down 
and pivot 


(57) 58 (70 79) 


Pivot Up Swap pivot 


and down 
Dn 
02 04 06 08 16 24 38 45 56 57 58 70 79 The array is 
sorted 
Time Complexity: 

Worst Case Performance O(n’) 

Best Case Performance(nearly) O(n log, n) 

Average Case Performance O(n log: n) 
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Merge Sort: 


Merge sort is based on Divide and conquer method. It takes the list to be sorted and divide it in half to 
create two unsorted lists. The two unsorted lists are then sorted and merged to get a sorted list. The two 
unsorted lists are sorted by continually calling the merge-sort algorithm; we eventually get a list of size 
1 which is already sorted. The two lists of size 1 are then merged. 


Merge Sort Procedure: 
This is a divide and conquer algorithm. 
This works as follows : 
1. Divide the input which we have to sort into two parts in the middle. Call it the left part and right part. 


2. Sort each of them separately. Note that here sort does not mean to sort it using some other method. 
We use the same function recursively. 


3. Then merge the two sorted parts. 


Input the total number of elements that are there in an array (number_of_elements). Input the array 
(array[number_of_elements]). Then call the function MergeSort() to sort the input array. MergeSort() 
function sorts the array in the range [left,right] i.e. from index left to index right inclusive. Merge() 
function merges the two sorted parts. Sorted parts will be from [left, mid] and [mid+1, right]. After 
merging output the sorted array. 


MergeSort() function: 


It takes the array, left-most and right-most index of the array to be sorted as arguments. Middle index 
(mid) of the array is calculated as (left + rght)/2. Check if (left<right) cause we have to sort only when 
left<right because when left=right it is anyhow sorted. Sort the left part by calling MergeSort() function 
again over the left part MergeSort(array,left,mid) and the right part by recursive call of MergeSort 
function as MergeSort(array,mid + 1, right). Lastly merge the two arrays using the Merge function. 


Merge() function: 
It takes the array, left-most , middle and right-most index of the array to be merged as arguments. 
Finally copy back the sorted array to the original array. 
Source Code: 
# Recursive Python Program for merge sort 
def merge(left, right): 


if not len(left) or not len(right): 
return left or right 


result = [] 
1, ) = 0, O 
while (len(result) < len(left) + len(right)): 
if left[i] < right[j]: 
result.append(left[1]) 
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i+= | 

else: 
result.append(right[] |) 
jt= 1 

if i == len(left) or j == len(right): 
result.extend(left[1:] or right[j:]) 
break 


return result 


def mergesort(list): 
if len(list) < 2: 
return list 


middle = int(len(list)/2) 
left = mergesort(list[:middle]) 
right = mergesort(list[middle: ]) 


return merge(left, right) 
seq = [12, 11, 13, 5, 6, 7] 
print("Given array 1s") 
print(seq); 
print(\n") 
print("Sorted array is") 
print(mergesort(seq)) 


Step-by-step example: 
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OC 





[3 [e |20 |27 |38 | 43 |82 


Merge Sort Example 
Time Complexity: 
Worst Case Performance O(n log, n) 
Best Case Performance(nearly) O(n log, n) 
Average Case Performance O(n log, n) 
Comparison of Sorting Algorithms: 
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Time Complexity comparison of Sorting Algorithms: 


Algorithm ||Data Structure Time Complexity 


Space Complexity comparison of Sorting Algorithms: 


Algorithm || Data Structure Worst Case Auxiliary Space 
Complexity 
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UNIT —- H LINEAR DATA STRUCTURES 


Stacks Primitive Operations: 


A stack is a container of objects that are inserted and removed according to the last-in first-out (LIFO) 
principle. In the pushdown stacks only two operations are allowed: push the item into the stack, and pop 
the item out of the stack. A stack 1s a limited access data structure - elements can be added and removed 
from the stack only at the top. Push adds an item to the top of the stack, pop removes the item from the 
top. A helpful analogy is to think of a stack of books; you can remove only the top book, also you can 
add a new book on the top. 


A stack may be implemented to have a bounded capacity. If the stack is full and does not contain enough 
space to accept an entity to be pushed, the stack is then considered to be in an overflow state. The pop 
Operation removes an item from the top of the stack. A pop either reveals previously concealed items or 
results in an empty stack, but, if the stack is empty, it goes into underflow state, which means no items 
are present in stack to be removed. 


_ - iz 


Push Yy / Pop 





Stack (ADT) Data Structure: 


Stack is an Abstract data structure (ADT) works on the principle Last In First Out (LIFO). The last 
element add to the stack is the first element to be delete. Insertion and deletion can be takes place at one 
end called TOP. It looks like one side closed tube. 

e The add operation of the stack is called push operation 

e The delete operation is called as pop operation. 

e Push operation on a full stack causes stack overflow. 

e Pop operation on an empty stack causes stack underflow. 

e SP is a pointer, which is used to access the top element of the stack. 

e If you push elements that are added at the top of the stack; 


e Inthe same way when we pop the elements, the element at the top of the stack is deleted. 
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Fig ül Fig 02 Fig 03 Fig 04 


Stack with 4 locations Stack with 4 locations Stack with 4 locations Stack with 4 locations 
SP = -] oP = 2 SF =O SP= 35 


Operations of stack: 

There are two operations applied on stack they are 

1. push 

2. pop. 

While performing push & pop operations the following test must be conducted on the stack. 

1) Stack is empty or not 

2) Stack is full or not 

Push: 

Push operation is used to add new elements in to the stack. At the time of addition first check the stack is 
full or not. If the stack is full it generates an error message "stack overflow". 

Pop: 

Pop operation is used to delete elements from the stack. At the time of deletion first check the stack is 
empty or not. If the stack is empty it generates an error message "stack underflow". 


Representation of a Stack using Arrays: 


Let us consider a stack with 6 elements capacity. This is called as the size of the stack. The number of 
elements to be added should not exceed the maximum size of the stack. If we attempt to add new 
element beyond the maximum size, we will encounter a stack overflow condition. Similarly, you cannot 
remove elements beyond the base of the stack. If such is the case, we will reach a stack underflow 
condition. 


When a element is added to a stack, the operation is performed by push(). 





q 4 4 
2O]. 
T OP 
"o E a| 
TOP 
[|e o ar 
TOP 
Empty Insert Insert 
Stack il F.F 


When an element is taken off from the stack, the operation is performed by pop(). 
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TOP 


Wo å W k 
WoO å W k 
Wo å W k 
Wo å W k 


Initial POP POP POP 
Starck 
Empty 
Stack 


Source code for stack operations, using array: 


STACK: Stack is a linear data structure which works under the principle of last in first out. Basic 
operations: push, pop, display. 


1. PUSH: if (top==MAX), display Stack overflow else reading the data and making stack [top] 
=data and incrementing the top value by doing top++. 


2. POP: if (top==0), display Stack underflow else printing the element at the top of the stack and 
decrementing the top value by doing the top. 


3. DISPLAY: IF (TOP==0), display Stack is empty else printing the elements in the stack from 
stack [0] to stack [top]. 


# Python program for implementation of stack 


# import maxsize from sys module 
# Used to return -infinite when stack is empty from sys import maxsize 


# Function to create a stack. It initializes size of stack as O 
def createStack(): 

stack = [] 

return stack 


# Stack is empty when stack size is 0 
def isEmpty(stack): 
return len(stack) == 0 


# Function to add an item to stack. It increases size by 1 
def push(stack, item): 

stack.append(item) 

print("pushed to stack " + item) 


# Function to remove an item from stack. It decreases size by 1 
def pop(stack): 
if (isEmpty(stack)): 
print("stack empty") 
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return str(-maxsize -1) #return minus infinite 
return stack.pop() 


# Driver program to test above functions 
stack = createStack() 

print("maximum size of array is",maxsize) 
push(stack, str(10)) 

push(stack, str(20)) 

push(stack, str(30)) 

print(pop(stack) + " popped from stack") 
print(pop(stack) + " popped from stack") 
print(pop(stack) + " popped from stack") 
print(pop(stack) + " popped from stack") 
push(stack, str(10)) 

push(stack, str(20)) 

push(stack, str(30)) 

print(pop(stack) + " popped from stack") 
Linked List Implementation of Stack: 


We can represent a stack as a linked list. In a stack push and pop operations are performed at one end 
called top. We can perform similar operations at one end of list using top pointer. 





Source code for stack operations, using linked list: 


# Python program for linked list implementation of stack 


# Class to represent a node 
class StackNode: 


# Constructor to initialize a node 
def init__(self, data): 
self.data = data 
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self. next = None 
class Stack: 


# Constructor to initialize the root of linked list 
def init__(self): 
self.root = None 


def isEmpty(self): 
return True if self.root is None else False 


def push(self, data): 
newNode = StackNode(data) 
newNode.next = self.root 
self.root = newNode 
print ("%d pushed to stack" %(data)) 


def pop(self): 
if (self.isEmpty()): 
return float("-inf") 
temp = self.root 
self. root = self.root.next 
popped = temp.data 
return popped 


def peek(self): 
if self.isEmpty(): 
return float('-inf") 
return self.root.data 


# Driver program to test above class 
stack = StackQ) 
stack.push(10) 
stack.push(20) 
stack.push(30) 


print ("%d popped from stack" %(stack.popQ)) 
print ("Top element is %d " %(stack.peek())) 
Stack Applications: 


1. Stack is used by compilers to check for balancing of parentheses, brackets and braces. 
2. Stack is used to evaluate a postfix expression. 


3. Stack is used to convert an infix expression into postfix/prefix form. 
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4. In recursion, all intermediate arguments and return values are stored on the processor’s stack. 


5. During a function call the return address and arguments are pushed onto a stack and on return 
they are popped off. 


6. Depth first search uses a stack data structure to find an element from a graph. 
In-fix- to Postfix Transformation: 
Procedure: 


Procedure to convert from infix expression to postfix expression is as follows: 


il Scan the infix expression from left to right. 
2. a) If the scanned symbol is left parenthesis, push it onto the stack. 

b) If the scanned symbol is an operand, then place directly in the postfix expression 
(output). 

C) If the symbol scanned is a right parenthesis, then go on popping all the items from 
the stack and place them in the postfix expression till we get the matching left 
parenthesis. 

d) If the scanned symbol is an operator, then go on removing all the operators from the 


stack and place them in the postfix expression, if and only if the precedence of the 
operator which is on the top of the stack is greater than (or equal) to the precedence 
of the scanned operator and push the scanned operator onto the stack otherwise, 
push the scanned operator onto the stack. 


Convert the following infix expression A + B * C—D/E * H into its equivalent postfix expression. 


AB 

AB 

ABC 

ABC * + 

ABC*+D 
/ ABC*+D 


E ABC*+DE 
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End of The input is now empty. Pop the output symbols from the 
string ABC*4DE/H®*- stack until it is empty. 





Source Code: 


# Python program to convert infix expression to postfix 


# Class to convert the expression 
import string 
class Conversion: 


# Constructor to initialize the class variables 
def __ init__(self, capacity): 

self.top = -1 

self.capacity = capacity 

# This array is used a stack 

self.array = [] 

# Precedence setting 

self.output = [] 

self precedence = {'+':1, '-":1, '*':2, 7:2, '%':3} 


# check if the stack 1s empty 
def isEmpty(self): 
return True if self.top == -1 else False 


# Return the value of the top of the stack 
def peek(self): 
return self.array|[-1] 


# Pop the element from the stack 
def pop(self): 
if not self.isEmpty(): 
self.top -= 1 
return self.array.pop() 
else: 
return "$" 


# Push the element to the stack 
def push(self, op): 
self.top += 1 
self.array.append(op) 


# A utility function to check is the given character 
# is operand 
def isOperand(self, ch): 

return ch.isalpha() 


# Check if the precedence of operator is strictly 
# less than top of stack or not 
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def notGreater(self, 1): 
try: 
a = self.precedence[1] 
b = self.precedence[self.peek() | 
return True if a <=b else False 
except KeyError: 
return False 


# The main function that converts given infix expression 
# to postfix expression 
def infixToPostfix(self, exp): 


# Iterate over the expression for conversion 
for 1 in exp: 
# If the character 1s an operand, 
# add it to output 
if self.isOperand(): 
self.output.append(1) 


# If the character is an '(', push it to stack 
elifi == '(': 
self.push() 


# If the scanned character is an ')', pop and 
# output from the stack until and '(' is found 
elif i =="): 
while( (not self.isEmpty()) and self.peek() != '(’): 
a = Self.popQ 
self.output.append(a) 
if (not self.isEmpty() and self.peekQ) != '(): 
return -1 
else: 


self popQ 


# An operator is encountered 
else: 
while(not self.isEmpty() and self.notGreater(1)): 
self.output.append(self.popQ) 
self.push() 


# pop all the operator from the stack 

while not self.isEmpty(): 
self.output.append(self.popQ) 

result= "" .join(self.output) 

print(result) 

# Driver program to test above function 

exp = "atb*(c4d-e)*(f+g*h)-1" 

obj = Conversion(len(exp)) 

obj.infixToPostfix(exp) 
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Evaluating Arithmetic Expressions: 
Procedure: 


The postfix expression is evaluated easily by the use of a stack. When a number is seen, it is pushed onto 
the stack; when an operator is seen, the operator is applied to the two numbers that are popped from the 
stack and the result is pushed onto the stack. 


Evaluate the postfix expression:6523+8*+3+* 


ee a T N 


6,5, 2,3 eNA first four symbols are 
placed on the stack. 
Next a ‘+’ is read, so 3 and 2 
+ 2 3 5 6,5,5 are popped from the stack and 
their sum 5, is pushed 


Now a ‘*’ is seen, so 8 and 5 
- 5 40 6, 5, 40 are popped as 8 * 5 = 40 is 
pushed 
Next, a ‘+’ is seen, so 40 and 
5 are popped and 40 + 5 = 45 
is pushed 
4 e ‘+? pops 3 and 45 and 
6, pushes 45 + 3 = 48 is pushed 
Finally, a ‘*’ is seen and 48 
i 48 288 288 and 6 are popped, the result 6 
* 48 = 288 is pushed 


Source Code: 





# Python program to evaluate value of a postfix expression 


# Class to convert the expression 
class Evaluate: 


# Constructor to initialize the class variables 
def __ init__(self, capacity): 
self.top = -1 
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self.capacity = capacity 
# This array is used a stack 
self.array = [] 


# check if the stack 1s empty 
def isEmpty(self): 
return True if self.top == -1 else False 


# Return the value of the top of the stack 
def peek(self): 


return self.array[-1] 


# Pop the element from the stack 


def pop(self): 
if not self.isEmpty(): 
self.top -= 1 
return self.array.pop() 
else: 
return "$" 


# Push the element to the stack 
def push(self, op): 
self.top += 1 
self.array.append(op) 


# The main function that converts given infix expression 
# to postfix expression 
def evaluatePostfix(self, exp): 


# Iterate over the expression for conversion 
for 1 in exp: 


# If the scanned character is an operand 
# (number here) push it to the stack 
if 1.isdigit(): 

self. push() 


# If the scanned character is an operator, 
# pop two elements from stack and apply it. 
else: 

vall = self.popQ 

val2 = self.pop(Q) 

self push(str(eval(val2 + 1 + vall1))) 


return int(self.popQ) 
# Driver program to test above function 
exp = "231*+9-" 


obj = Evaluate(len(exp)) 
print ("Value of {0} is {1}".format(exp, obj.evaluatePostfix(exp))) 
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Basic Queue Operations: 


A queue is a data structure that is best described as "first in, first out". A queue is another special kind of 
list, where items are inserted at one end called the rear and deleted at the other end called the front. A 
real world example of a queue is people waiting in line at the bank. As each person enters the bank, he 
or she is "enqueued" at the back of the line. When a teller becomes available, they are "dequeued" at the 
front of the line. 


Representation of a Queue using Array: 


Let us consider a queue, which can hold maximum of five elements. Initially the queue is empty. 


Queue Empty 
FRONT = REAR = 0 


F R 


Now, insert 11 to the queue. Then queue status will be: 


0 1 2 3 4 
11 REAR = REAR + 1= 1 
FRONT = 0 


Next, insert 22 to the queue. Then the queue status 1s: 


0 1 2 3 4 
REAR = REAR + 1 = 2 
“AY | FRONT = 0 


F R 


Again insert another element 33 to the queue. The status of the queue is: 
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0 1 2 3 4 
REAR = REAR + 1 = 3 
pu fez fos} | FRONT = 0 
F R 


Now, delete an element. The element deleted is the element at the front of the queue. So the status of the 
queue is: 


© i 2 3 4 
REAR =3 


F R 


Again, delete an element. The element to be deleted is always pointed to by the FRONT pointer. So, 22 
is deleted. The queue status is as follows: 


0 1 2 3 4 

33 REAR = 3 

FRONT = FRONT + 1 = 2 
F 

Now, insert new elements 44 and 55 into the queue. The queue status is: 


0 1 2 3 4 
REAR = 5 
|| 22 fae | ss FRONT = 2 


F R 


Next insert another element, say 66 to the queue. We cannot insert 66 to the queue as the rear crossed the 
maximum size of the queue (1.e., 5). There will be queue full signal. The queue status is as follows: 
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0 1 2 3 4 
REAR = 5 
|| fae fae | ss FRONT = 2 


F R 


Now it is not possible to insert an element 66 even though there are two vacant positions in the linear 
queue. To over come this problem the elements of the queue are to be shifted towards the beginning of 
the queue so that it creates vacant position at the rear end. Then the FRONT and REAR are to be 
adjusted properly. The element 66 can be inserted at the rear end. After this operation, the queue status 1s 
as follows: 


0 1 2 3 4 
REAR = 4 
ps3 [aa ]ss fos] FRONT 0 


F R 


This difficulty can overcome if we treat queue position with index O as a position that comes after 
position with index 4 1.e., we treat the queue as a circular queue. 


Procedure for Queue operations using array: 


In order to create a queue we require a one dimensional array Q(1:n) and two variables front and rear. 
The conventions we shall adopt for these two variables are that front is always 1 less than the actual 
front of the queue and rear always points to the last element in the queue. Thus, front = rear if and only if 
there are no elements in the queue. The initial condition then is front = rear = 0. 


The various queue operations to perform creation, deletion and display the elements in a queue are as 


follows: 
1. insertQ(): inserts an element at the end of queue Q. 
2. deleteQ(): deletes the first element of Q. 
3. displayQQ: displays the elements in the queue. 


Linked List Implementation of Queue: We can represent a queue as a linked list. In a queue data is 
deleted from the front end and inserted at the rear end. We can perform similar operations on the two 
ends of a list. We use two pointers front and rear for our linked queue implementation. 


fra nt rear 


10 |200}——» zo |300» 20 [soo | ao |X 
100 #00 +00 400 
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Source Code: 
front = 0 
rear = 0 
mymax = 3 
# Function to create a stack. It initializes size of stack as O 
def createQueue(): 
queue = |] 
return queue 
# Stack is empty when stack size is 0 
def isEmpty(queue): 
return len(queue) == 0 
# Function to add an item to stack. It increases size by 1 
def enqueue(queue,item): 
queue.append(item) 
# Function to remove an item from stack. It decreases size by 1 
def dequeue(queue): 
if (GsEmpty(queue)): 
return "Queue is empty" 
item=queue[0] 
del queue[0] 
return item 
# Driver program to test above functions 
queue = createQueue() 
while True: 
print(" 1 Enqueue") 
print(""2 Dequeue") 
print("3 Display") 


print("4 Quit") 
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ch=int(input("Enter choice")) 
if(ch==1): 
if(rear < mymax): 
item=input("enter item") 
enqueue(queue, item) 
rear = rear + | 
else: 
print("Queue is full") 
elif(ch==2): 
print(dequeue(queue)) 
elif(ch==3): 
print(queue) 
else: 
break 
Applications of Queues: 
1. Itis used to schedule the jobs to be processed by the CPU. 


2. When multiple users send print jobs to a printer, each printing job is kept in the printing queue. 
Then the printer prints those jobs according to first in first out (FIFO) basis. 


3. Breadth first search uses a queue data structure to find an element from a graph. 
Disadvantages of Linear Queue: 
There are two problems associated with linear queue. They are: 
e Time consuming: linear time to be spent in shifting the elements to the beginning of the queue. 
e Signaling queue full: even if the queue is having vacant position. 
Round Robin Algorithm: 


The round-robin (RR) scheduling algorithm is designed especially for time-sharing systems. It is similar 
to FCFS scheduling, but pre-emption is added to switch between processes. A small unit of time, called 

a time quantum or time slices, is defined. A time quantum is generally from 10 to 100 milliseconds. The 
ready queue is treated as a circular queue. To implement RR scheduling 


e We keep the ready queue as a FIFO queue of processes. 


e New processes are added to the tail of the ready queue. 
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e The CPU scheduler picks the first process from the ready queue, sets a timer to interrupt after 1 
time quantum, and dispatches the process. 


e The process may have a CPU burst of less than | time quantum. 
o In this case, the process itself will release the CPU voluntarily. 
o The scheduler will then proceed to the next process in the ready queue. 

e Otherwise, if the CPU burst of the currently running process is longer than 1 time quantum, 
o The timer will go off and will cause an interrupt to the OS. 


o A context switch will be executed, and the process will be put at the tail of the ready 
queue. 


o The CPU scheduler will then select the next process in the ready queue. 


Current Next Current 
process process process 


HGad DB HO G 


(a) (b) 


The average waiting time under the RR policy is often long. Consider the following set of processes that 
arrive at time O, with the length of the CPU burst given in milliseconds: (a time quantum of 4 
milliseconds) 


Burst | Waiting | Turnaround 


Process |Time | Time Time 
24 6 30 
3 4 7 
3 7 10 
Average | - 5.66 15.66 


Using round-robin scheduling, we would schedule these processes according to the following chart: 





—] 
Ja 
Taj 
= 

— 
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in 
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DEQUE(Double Ended Queue): 


A double-ended queue (dequeue, often abbreviated to deque, pronounced deck) generalizes a queue, 
for which elements can be added to or removed from either the front (head) or back (tail).It is also often 
called a head-tail linked list. Like an ordinary queue, a double-ended queue is a data structure it 
supports the following operations: eng_front, eng_back, deq_front, deq_back, and empty. Dequeue can 
be behave like a queue by using only enq_front and deq_front , and behaves like a stack by using only 
eng_front and deq_rear. 


The DeQueue is represented as follows. 


front rear 


deletion imgertion 
insertion deletion 


DEQUE can be represented in two ways they are 
1) Input restricted DEQUECURD) 
2) output restricted DEQUE(ORD) 


The output restricted DEQUE allows deletions from only one end and input restricted DEQUE allow 
insertions at only one end. The DEQUE can be constructed in two ways they are 


1) Using array 
2)Using linked list 
Operations in DEQUE 
1. Insert element at back 
2. Insert element at front 
3. Remove element at front 


4. Remove element at back 
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Insert_ front 


® insert_front() is a operation used to push an element into the 


front of the Deque. 
o > [2b [> [71 


FRONT REAR 





Insert_ back 


® insert_back() is a operation used to push an element at the 


back of a Deque. 
[os | |e ele eee 


/ / 


FRONT REAR 


ajep fe fs |7 
Remove _ front 


* remove_front() is a operation used to pop an element on front 


of the Deque. 
M paj [oe | x] 


FRONT REAR 
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Remove back 


e remove_front() is a operation used to pop an element on front 


of the Deque. 


i 
| 


\ | 
| 
\ | 


gaaat 
f 


FRONT REAR 





Applications of DEQUE: 


1. The A-Steal algorithm implements task scheduling for several processors (multiprocessor 
scheduling). 


2. The processor gets the first element from the deque. 


3. When one of the processor completes execution of its own threads it can steal a thread from 
another processor. 


4. It gets the last element from the deque of another processor and executes it. 
Circular Queue: 


Circular queue is a linear data structure. It follows FIFO principle. In circular queue the last node is 
connected back to the first node to make a circle. 


e Circular linked list fallow the First In First Out principle 
e Elements are added at the rear end and the elements are deleted at front end of the queue 
e Both the front and the rear pointers points to the beginning of the array. 
e Itis also called as “Ring buffer”. 
e Items can inserted and deleted from a queue in O(1) time. 
Circular Queue can be created in three ways they are 
1. Using single linked list 
2. Using double linked list 


3. Using arrays 
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Representation of Circular Queue: 


Let us consider a circular queue, which can hold maximum (MAX) of six elements. Initially the queue is 
empty. 


ih 


1 Queue Empty 
4 MAX = 
FRONT = REAR = O 
COUNT = O 


3 2 


Circular Queue 


Now, insert 11 to the circular queue. Then circular queue status will be: 


FRONT = 0 


REAR = (REAR + 1) % 6 = 1 
COUNT = 1 





Circular Queue 


Insert new elements 22, 33, 44 and 55 into the circular queue. The circular queue status is: 


FRONT = 0, REAR 
REAR = REAR 
COUNT 


= 5 
% 6 = 5 


= 5 





Circular Queue 


Now, delete an element. The element deleted is the element at the front of the circular queue. So, 11 is 
deleted. The circular queue status is as follows: 


52 


Download FREE Computer Science Notes at TutorialsDuniya.com 


Download FREE Computer Science Notes at TutorialsDuniya.com 


FRONT = (FRONT + 1) % 6 = 1 
REAR = 
COUNT = COUNT - 1 = 4 


UI 





Circular Queue 


Again, delete an element. The element to be deleted is always pointed to by the FRONT pointer. So, 22 
is deleted. The circular queue status 1s as follows: 


FRONT = (FRONT + 1) % 6 = 2 
REAR = 5 
COUNT = COUNT - 1 = 3 





Circular Queue 


Again, insert another element 66 to the circular queue. The status of the circular queue 1s: 


FRONT = 2 
REAR = (REAR + 1) % 6 = 0 
COUNT = COUNT + 1 = 4 





Circular Queue 


Now, insert new elements 77 and 88 into the circular queue. The circular queue status 1s: 
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FRONT = 2, REAR = 2 
REAR = REAR % 6 = 2 
COUNT = 6 





Circular Queue 


Now, if we insert an element to the circular queue, as COUNT = MAX we cannot add the element to 
circular queue. So, the circular queue 1s full. 
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UNIT-3 
LINKED LISTS 


Linked lists and arrays are similar since they both store collections of data. Array is the most common 
data structure used to store collections of elements. Arrays are convenient to declare and provide the easy 
syntax to access any element by its index number. Once the array is set up, access to any element is 
convenient and fast. 


The disadvantages of arrays are: 


e The size of the array is fixed. Most often this size is specified at compile time. This makes the 
programmers to allocate arrays, which seems "large enough" than required. 

e Inserting new elements at the front is potentially expensive because existing elements need to be 
shifted over to make room. 

e Deleting an element from an array is not possible. Linked lists have their own strengths and 
weaknesses, but they happen to be strong where arrays are weak. 

e Generally array's allocates the memory for all its elements in one block whereas linked lists use 
an entirely different strategy. Linked lists allocate memory for each element separately and only 
when necessary. 


Linked List Concepts: 


A linked list is a non-sequential collection of data items. It is a dynamic data structure. For every data 
item in a linked list, there is an associated pointer that would give the memory location of the next data 
item in the linked list. The data items in the linked list are not in consecutive memory locations. They 
may be anywhere, but the accessing of these data items is easier as each data item contains the address of 
the next data item. 


Advantages of linked lists: 
Linked lists have many advantages. Some of the very important advantages are: 


1. Linked lists are dynamic data structures. 1.e., they can grow or shrink during the execution of a 
program. 

2. Linked lists have efficient memory utilization. Here, memory is not preallocated. Memory is 
allocated whenever it is required and it is de-allocated (removed) when it is no longer needed. 

3. Insertion and Deletions are easier and efficient. Linked lists provide flexibility in inserting a data 
item at a specified position and deletion of the data item from the given position. 

4. Many complex applications can be easily carried out with linked lists. 


Disadvantages of linked lists: 


1. It consumes more space because every node requires a additional pointer to store address of the 


next node. 
2. Searching a particular element in list is difficult and also time consuming. 
Types of Linked Lists: 


Basically we can put linked lists into the following four items: 


1. Single Linked List. 
2. Double Linked List. 
3. Circular Linked List. 
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4. Circular Double Linked List. 


A single linked list is one in which all nodes are linked together in some sequential manner. Hence, it is 
also called as linear linked list. 


A double linked list is one in which all nodes are linked together by multiple links which helps in 
accessing both the successor node (next node) and predecessor node (previous node) from any arbitrary 
node within the list. Therefore each node in a double linked list has two link fields (pointers) to point to 
the left node (previous) and the right node (next). This helps to traverse in forward direction and 
backward direction. 


A circular linked list is one, which has no beginning and no end. A single linked list can be made a 
circular linked list by simply storing address of the very first node in the link field of the last node. 


A circular double linked list 1s one, which has both the successor pointer and predecessor pointer in the 
circular manner. 


Comparison between array and linked list: 


LINKED LIST 


Size of an array is fixed size of a list is not fixed 


Memory is allocated from stack Memory is allocated from heap 


It is necessary to specify the number of | It is not necessary to specify the 
elements during declaration (i.e., during | number of elements during declaration 
compile time). (i.e., memory is allocated during run 


time 


It occupies less memory than a linked | It occupies more memory. 
list for the same number of elements. 


Inserting new elements at the front is | Inserting a new element at any position 
potentially expensive because existing | can be carried out easily. 

elements need to be shifted over to 

make room. 

Deleting an element from an array is | Deleting an element is possible. 

not possible. 





Trade offs between linked lists and arrays: 


FEATURE ARRAYS LINKED LISTS 


Overhead per elements none 1 or 2 links 
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Applications of linked list: 


1. Linked lists are used to represent and manipulate polynomial. Polynomials are expression containing 


terms with non zero coefficient and exponents. For example: P(x) = a0 Xn + al Xn-I + ...... + an-1 
X +an 

2. Represent very large numbers and operations of the large number such as addition, multiplication and 
division. 


3. Linked lists are to implement stack, queue, trees and graphs. 4. Implement the symbol table in 
compiler construction. 


Single Linked List: 


A linked list allocates space for each element separately in its own block of memory called a "node". The 
list gets an overall structure by using pointers to connect all its nodes together like the links in a chain. 
Each node contains two fields; a "data" field to store whatever element, and a "next" field which is a 
pointer used to link to the next node. Each node is allocated in the heap using malloc(), so the node 
memory continues to exist until it is explicitly de-allocated using free(). The front of the list is a pointer to 
the “start” node. 


100 : “a 200 


The start 7 ; "i 
pointer holds = 

mri Each node stores Stores the next , la 
the address e ia. = ra The next field of the 
of the first last node is NULL. 


node of the 
list. 





A single linked list 


The beginning of the linked list is stored in a "start" pointer which points to the first node. The first node 
contains a pointer to the second node. The second node contains a pointer to the third node, ... and so on. 
The last node in the list has its next field set to NULL to mark the end of the list. Code can access any 
node in the list by starting at the start and following the next pointers. 


The start pointer is an ordinary local pointer variable, so it is drawn separately on the left top to show that 
it is in the stack. The list nodes are drawn on the right to show that they are allocated in the heap. 


The basic operations in a single linked list are: 
e Creation. 

e Insertion. 

e Deletion. 

e Traversing. 

Creating a node for Single Linked List: 


Creating a singly linked list starts with creating a node. Sufficient memory has to be allocated for creating 
a node. The information is stored in the memory. 
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120__| 300; —P_30_ | 400} ao |X 
“sy 200 300 400 |” 


100 : 
i 
The next field of the 


W k- 
‘Each node stores a Stores the next 
| last node is NULL. 


node address. 





Insertion of a Node: 


One of the most primitive operations that can be done in a singly linked list is the insertion of a node. 
Memory is to be allocated for the new node (in a similar way that is done while creating a list) before 
reading the data. The new node will contain empty data field and empty next field. The data field of the 
new node is then stored with the information read from the user. The next field of the new node is 
assigned to NULL. The new node can then be inserted at three different places namely: 


e Inserting a node at the beginning. 
e Inserting a node at the end. 


e Inserting a node at intermediate position. 


e Inserting a node at the beginning. 





e Inserting a node at the end: 


58 


Download FREE Computer Science Notes at TutorialsDuniya.com 


Download FREE Computer Science Notes at TutorialsDuniya.com 





e Inserting a node into the single linked list at a specified intermediate position other than 
beginning and end. 








Deletion of a node: 


Another primitive operation that can be done in a singly linked list is the deletion of a node. Memory is 


to be released for the node to be deleted. A node can be deleted from the list from three different places 
namely. 


e Deleting a node at the beginning. 
e Deleting a node at the end. 
e Deleting a node at intermediate position. 


Deleting a node at the beginning: 
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Deleting a node at the end: 





Deleting a node at Intermediate position: 


The following steps are followed, to delete a node from an intermediate position in the list (List must 
contain more than two node). 


sa Tao AD 





Traversal and displaying a list (Left to Right): 


To display the information, you have to traverse (move) a linked list, node by node from the first node, 
until the end of the list is reached. 


Traversing a list involves the following steps: 
e Assign the address of start pointer to a temp pointer. 


e Display the information from the data field of each node. The function traverse () is used for traversing 
and displaying the information stored in the list from left to right. 


Source Code for creating and inserting the Implementation of Single Linked List: 
class Item: 
"An Item that stores the data of a Linked List” 
def init__(self, data): 
"Initializes the data of the Item" 
self._ content = data # non-public data 


self.next = None 


class SingleLikedList: 
Single Linked List Implementation" 
def init__(self): 


self. start = None 
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self. count = 0 


def getItemAtIndex(self, pos): 
if self. start == None: 
return None 
i=0 
cursor = Self._ start 
while cursor != None: 
if 1 == pos: 
return cursor 
it= 1 


cursor = cursor.next 
return None 


def insert(self, data, pos = 0): 


item = Item(data) 


if self. start == None: 
self. start = item 
else: 
cursor = self.getItemAtIndex(pos) 
item.next = cursor.next 
item.next = cursor.next 


cursor.next = item 


self. count += 1 
def display(self): 
cursor = self. start 
while cursor != None: 
print(cursor._content, end=" ') 
cursor = cursor.next 
1 = SingleLikedList() 
l.insert(10) 
l.insert(20) 
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l.insert(30) 
l.insert(40) 
L.insert(50, 3) 
|.displayQ 


Source Code for creating , inserting ,deleting the Implementation of Single Linked List: 


class Node(object): 

def init__(self, data=None, next_node=None): 
self.data = data 
self.next_node = next_node 

def get_data(self): 
return self.data 

def get_next(self): 
return self.next_node 

def set_next(self, new_next): 


self.next_node = new_next 


class LinkedList(object): 
def init__(self, head=None): 
self head = head 
def insert(self, data): 
new_node = Node(data) 
new_node.set_next(self.head) 
self.head = new_node 
def size(self): 
current = self.head 
count = 0 
while current: 
count += | 
current = current.get_next() 
return count 
def search(self, data): 
current = self.head 


found = False 
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while current and found 1s False: 
if current.get_data() == data: 
found = True 
else: 
current = current.get_next() 
if current is None: 
raise ValueError("Data not in list") 
return current 
def delete(self, data): 
current = self.head 
previous = None 
found = False 
while current and found 1s False: 
if current.get_data() == data: 
found = True 
else: 
previous = current 
current = current.get_next() 
if current is None: 
raise ValueError("Data not in list") 
if previous is None: 
self. head = current.get_next() 
else: 


previous.set_next(current.get_next()) 


Source Code for creating , inserting ,deleting the Implementation of Single Linked List: 
import sys 
import os.path 
sys.path.append(os.path.join(os.path.abspath(os.pardir), '"/home/satya/PycharmProjects/DataStractures" )) 
from LinkedList2 import LinkedList 
import unittest 
class TestLinkedList(unittest.TestCase): 

def setUp(self): 

self. list = LinkedList() 
def tearDown(self): 
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self list = None 
def test_insert(self): 
self.list.insert( David") 
self.assertTrue(self.list.head.get_data() == "David") 
self.assertTrue(self.list.head.get_next() is None) 
def test_insert_two(self): 
self.list.insert(" David") 
self.list.insert("Thomas") 
self.assertTrue(self.list.head.get_data() == "Thomas") 
head_next = self.list.head.get_next() 
self.assertTrue(head_next.get_data() == "David") 
def test_nextNode(self): 
self.list.insert("Jacob") 
self. list.insert("Pallymay") 
self.list.insert("Rasmus") 
self.assertTrue(self.list.head.get_data() == "Rasmus") 
head_next = self.list.head.get_next() 
self.assertTrue(head_next.get_data() == "Pallymay") 
last = head_next.get_next() 
self.assertTrue(last.get_data() == "Jacob") 
def test_positive_search(self): 
self.list.insert("Jacob") 
self. list.insert("Pallymay") 
self.list.insert("Rasmus") 
found = self. list.search("Jacob") 
self.assertIrue(found.get_data() == "Jacob") 
found = self. list.search(""Pallymay") 
self.assertIrue(found.get_data() == "Pallymay") 
found = self. list.search(""Jacob") 
self.assertTrue(found.get_data() == "Jacob") 
def test_searchNone(self): 
self.list.insert("Jacob") 
self. list.insert("Pallymay") 
# make sure reg search works 


found = self. list.search("Jacob") 
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self.assertTrue(found.get_data() == "Jacob") 
with self.assertRaises( V alueError): 
self.list.search(" Vincent") 
def test_delete(self): 
self. list.insert("Jacob") 
self.list.insert("Pallymay”) 
self. list.insert("Rasmus") 
# Delete the list head 
self.list.delete("Rasmus") 
self.assertTrue(self.list.head.get_data() == "Pallymay") 
# Delete the list tail 
self.list.delete("Jacob") 
self.assertTrue(self.list.head.get_next() is None) 
def test_delete_value_not_in_list(self): 
self.list.insert("Jacob") 
self.list.insert("Pallymay”) 
self.list.insert("Rasmus") 
with self.assertRaises( V alueError): 
self.list.delete(" Sunny") 
def test_delete_empty_list(self): 
with self.assertRaises( V alueError): 
self.list.delete("Sunny") 
def test_delete_next_reassignment(self): 
self.list.insert("Jacob") 
self. list.insert("Cid") 
self.list.insert("Pallymay”) 
self. list.insert("Rasmus") 
self.list.delete("Pallymay") 
self. list.delete("Cid") 
self.assertTrue(self.list.head.next_node.get_data() == "Jacob") 


Source Code for creating , inserting ,deleting the Implementation of Single Linked List: 


class Node(object): 
def init__(self, data, next): 
self.data = data 
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self.next = next 
class SingleList(object): 
head = None 
tail = None 
def show(self): 
print "Showing list data:" 
current_ node = self. head 
while current_node is not None: 
print current_node.data, " -> ", 
current_ node = current_node.next 
print None 
def append(self, data): 
node = Node(data, None) 
if self.head is None: 
self.head = self.tail = node 
else: 
self.tail.next = node 
self.tail = node 
def remove(self, node_value): 
current_ node = self.head 
previous_node = None 
while current_node is not None: 
if current_node.data == node_ value: 
# if this is the first node (head) 
if previous_node is not None: 
previous_node.next = current_node.next 
else: 


self.head = current_node.next 


# needed for the next iteration 
previous_node = current_node 
current_ node = current_node.next 
s = SingleList() 
s.append(3 |) 
s.append(2) 
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s.append(3) 
s.append(4) 
s.show() 
s.remove(3 |) 
s.remove(3) 
s.remove(2) 


s.show() 


Using a header node: 


A header node is a special dummy node found at the front of the list. The use of header node is an 
alternative to remove the first node in a list. For example, the picture below shows how the list with data 
10, 20 and 30 would be represented using a linked list without and with a header node: 





Single Linked List without a header node 





Single Linked List with header node 


Note that if your linked lists do include a header node, there is no need for the special case code given 
above for the remove operation; node n can never be the first node in the list, so there is no need to check 
for that case. Similarly, having a header node can simplify the code that adds a node before a given node 
n. 


Note that if you do decide to use a header node, you must remember to initialize an empty list to contain 
one (dummy) node, you must remember not to include the header node in the count of "real" nodes in the 
list. 


It is also useful when information other than that found in each node of the list is needed. For example, 
imagine an application in which the number of items in a list is often calculated. In a standard linked list, 
the list function to count the number of nodes has to traverse the entire list every time. However, if the 
current length is maintained in a header node, that information can be obtained very quickly. 3.5. Array 
based linked lists: Another alternative is to allocate the nodes in blocks. In fact, if you know the 
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maximum size of a list a head of time, you can pre-allocate the nodes in a single array. The result is a 
hybrid structure — an array based linked list. 


shows an example of null terminated single linked list where all the nodes are allocated contiguously in 
an array. 


Conceptual structure 


Implementation 





Double Linked List: A double linked list is a two-way list in which all nodes will have two links. This 
helps in accessing both successor node and predecessor node from the given node position. It provides bi- 
directional traversing. Each node contains three fields: 


e = Left link. 
e Data. 
e Right link. 


The left link points to the predecessor node and the right link points to the successor node. The data field 
stores the required data. 


Many applications require searching forward and backward thru nodes of a list. For example searching 
for a name in a telephone directory would need forward and backward scanning thru a region of the whole 
list. 


The basic operations in a double linked list are: 


Creation. 
Insertion. 
Deletion. 
Traversing. 


A double linked list is shown in figure 
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Stores the previous 
node address. 


Pots 
* X| 10 [200 >i | 20 | 300200 | 30 | x 


The start 100 | 7 200 300 
pointer holds Y D £- 

the address Stores the data. Stores the next The right field of the 
of the first node address. last node is NULL. 
node of the 


list. 





The beginning of the double linked list is stored in a "start" pointer which points to the first node. The 
first node’s left link and last node’s right link 1s set to NULL. 


Creating a node for Double Linked List: 


Creating a double linked list starts with creating a node. Sufficient memory has to be allocated for 
creating a node. The information is stored in the memory. 


Double Linked List with 3 nodes: 


MESE = MENES 


200 300 





Inserting a node at the beginning: 


OEE 


200 300 





Inserting a node at the end: 
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Inserting a node at an intermediate position: 





Deleting a node at the beginning: 











[1o] 20 | x |g 200 1 30! x | 


200 300 





Deleting a node at Intermediate position: 
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Traversal and displaying a list (Left to Right): 


To display the information, you have to traverse the list, node by node from the first node, until the end 
of the list is reached. The function traverse_left_right() is used for traversing and displaying the 
information stored in the list from left to right. 


Traversal and displaying a list (Right to Left): 


To display the information from right to left, you have to traverse the list, node by node from the first 
node, until the end of the list is reached. The function traverse_right_left() is used for traversing and 
displaying the information stored in the list from right to left. 


1.Source Code for the Implementation of Double Linked List: 
class DList: 
def init__(self): 
self head = None 
self.size = O 
def insert(self, data): 
self.size += 1 
if self.head is None: 
self.head = Node(data) 
else: 
p = Node(data) 
p.next = self.head 
self.head.previous = p 
self.head = p 
def remove(self, index): 
if self.head is None: 
raise ValueError(‘Removing off an empty list’) 
if index < 0 or index >= self.size: 


raise IndexError("Index is either negative or greater than the list size." 
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current = self.head 
if index == 0: 
self head = self.head.next 
self. head.previous = None 
else: 
for _ in range(index -1): 
current = current.next 
p = current.next.next 
if p is None: 
current.next = None 
else: 
current.next = p 
p.previous = current 
def __sizeof _ (self): 
return self.size 
def _repr_ (self): 
res = '[' 
current = self.head 
while current is not None: 
res += str(current.data) 
res +='' 
current = current.next 
res += 'T' 
return res 
class Node: 
def init__(self, data): 
if data is None: 
raise ValueError('Node value cannot be None’) 
self.data = data 
self.previous = None 


self.next = None 


Source Code for the Implementation of Double Linked List: 
class Node(object): 


def __ init__(self, data, prev, next): 
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self.data = data 
self.prev = prev 


self.next = next 


class DoubleList(object): 
head = None 
tail = None 
def append(self, data): 
new_node = Node(data, None, None) 
if self.head is None: 
self.head = self.tail = new_node 
else: 
new_node.prev = self.tail 
new_node.next = None 
self.tail.next = new_node 
self.tail = new_node 
def remove(self, node_value): 


current_node = self.head 


while current_node is not None: 
if current_node.data == node_ value: 
# if it's not the first element 
if current_node.prev is not None: 
current_node.prev.next = current_node.next 


current_node.next.prev = current_node.prev 


else: 
# otherwise we have no prev (it's None), head is the next one, and prev becomes None 


self.head = current_node.next 


current_node.next.prev = None 


current_ node = current_node.next 


def show(self): 
print "Show list data:" 
current_node = self.head 
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while current_node is not None: 
print current_node.prev.data if hasattr(current_node.prev, "data") else None, 
print current_node.data, 


print current_node.next.data if hasattr(current_node.next, "data") else None 


current_ node = current_ node.next 
print "*"*50 

d = DoubleList() 

d.append(5) 

d.append(6) 

d.append(50) 

d.append(30) 

d.show() 

d.remove(50) 

d.remove(5) 


d.show() 


Circular Single Linked List: 


It is just a single linked list in which the link field of the last node points back to the address of the first 
node. A circular linked list has no beginning and no end. It is necessary to establish a special pointer 
called start pointer always pointing to the first node of the list. Circular linked lists are frequently used 
instead of ordinary linked list because many operations are much easier to implement. In circular linked 
list no null pointers are used, hence all pointers contain valid address. 


Creating a circular single Linked List with ‘n’ number of nodes: 





The basic operations in a circular single linked list are: 
e Creation. 

e Insertion. 

e Deletion. 


e Traversing. 
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Inserting a node at the beginning: 























Deleting a node at the beginning: 





Source Code for Circular Single Linked List: 


75 


Download FREE Computer Science Notes at TutorialsDuniya.com 


Download FREE Computer Science Notes at TutorialsDuniya.com 


from enum import Enum 
class NodeConstants(Enum): 
FRONT_NODE = 1 
class Node: 
def init__(self, element=None, next_node=None): 
self.element = element 
self.next_node = next_node 
def _str__(self): 
if self.element: 
return self.element._ str__() 
else: 
return ‘Empty Node' 
def __repr__(self): 
return self. str__(Q 
class CircularLinkedList: 
def init__(self): 
self.head = Node(element=NodeConstants. FRONT NODE) 
self.head.next_node = self.head 
def size(self): 
count = 0 
current = self.head.next_node 
while current != self.head: 
count += 1 
current = current.next_ node 
return count 
def insert_front(self, data): 
node = Node(element=data, next_node=self.head.next_node) 
self.head.next_ node = node 
def insert_last(self, data): 
current_node = self.head.next_node 
while current_node.next_ node != self.head: 
current_ node = current_node.next_node 
node = Node(element=data, next_node=current_node.next_node) 
current_node.next_node = node 


def insert(self, data, position): 
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if position == 0: 
self.insert_front(data) 
elif position == self.size(): 
self.insert_last(data) 
else: 
if O < position < self.size(): 
current_node = self.head.next_ node 
current_pos = 0 
while current_pos < position - 1: 
current_pos += 1 
current_ node = current_node.next_node 
node = Node(data, current_node.next_node) 
current_node.next_ node = node 
else: 
raise IndexError 
def remove_first(self): 
self.head.next_node = self.head.next_node.next_ node 
def remove_last(self): 
current_node = self.head.next_node 
while current_node.next_ node.next_node != self.head: 
current_ node = current_node.next_node 
current. node.next_node = self.head 
def remove(self, position): 
if position == 0: 
self.remove_first() 
elif position == self.size(): 
self.remove_last() 
else: 
if O < position < self.size(): 
current_node = self.head.next_ node 
current_pos = 0 
while current_pos < position - 1: 
current_ node = current_node.next_node 
current_pos += 1 


current_node.next_node = current_node.next_node.next_node 
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else: 
raise IndexError 
def fetch(self, position): 
if O <= position < self.sizeQ): 
current_ node = self.head.next_ node 
current_pos = 0 
while current_pos < position: 
current_ node = current_node.next_node 
current_pos += 1 
return current_ node.element 
else: 
raise IndexError 
import unittest 
from random import randint 
class TestCircularLinkedList(unittest.TestCase): 
names = ['Bob Belcher’, 
'Linda Belcher’, 
"Tina Belcher’, 
'Gene Belcher’, 
‘Louise Belcher'] 
def test_init(self): 
dll = CircularLinkedList() 
self.assertIsNotNone(dll.head) 
self.assertEqual(dll.sizeQ, 0) 
def test_insert_front(self): 
dll = CircularLinkedList() 
for name in TestCircularLinkedList.names: 
dll.insert_front(name) 
self.assertEqual(dll.fetch(O), TestCircularLinkedList.names[4]) 
self.assertEqual(dll.fetch(1), TestCircularLinkedList.names[3]) 
self.assertEqual(dll.fetch(2), TestCircularLinkedList.names[2]) 
self.assertEqual(dll.fetch(3), TestCircularLinkedList.names[1 ]) 
self.assertEqual(dll.fetch(4), TestCircularLinkedList.names[0]) 


def test_insert_last(self): 
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dll = CircularLinkedList() 
for name in TestCircularLinkedList.names: 
dll.insert_last(name) 
for 1 in range(len(TestCircularLinkedList.names) - 1): 
self.assertEqual(dll.fetchG), TestCircularLinkedList.names[1]) 
def test_insert(self): 
dll = CircularLinkedList() 
for name in TestCircularLinkedList.names: 
dll.insert_last(name) 
pos = randint(O, len(TestCircularLinkedList.names) - 1) 
dlil.insert('Teddy', pos) 
self.assertEqual(dll.fetch(pos), "Teddy') 
def test_remove_first(self): 
dll = CircularLinkedList() 
for name in TestCircularLinkedList.names: 
dll.insert_last(name) 
for 1 in range(dll.sizeQ), 0, -1): 
self.assertEqual(dll.size(), 1) 
dll.remove_first() 
def test_remove_last(self): 
dll = CircularLinkedList() 
for name in TestCircularLinkedList.names: 
dll.insert_last(name) 
for 1 in range(dll.sizeQ), 0, -1): 
self.assertEqual(dll.size(), 1) 
dll.remove_last() 
def test_remove(self): 
dll = CircularLinkedList() 
for name in TestCircularLinkedList.names: 
dll.insert_last(name) 
dil.remove(1) 
self.assertEqual(dll.fetch(O), 'Bob Belcher’) 
self.assertEqual(dll.fetch(1), "Tina Belcher’) 
self.assertEqual(dll.fetch(2), 'Gene Belcher’) 
self.assertEqual(dll.fetch(3), 'Louise Belcher’) 
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1 


if name =='_ main ': 


unittest.main() 


Circular Double Linked List: 


A circular double linked list has both successor pointer and predecessor pointer in circular manner. The 
objective behind considering circular double linked list is to simplify the insertion and deletion operations 
performed on double linked list. In circular double linked list the right link of the right most node points 
back to the start node and left link of the first node points to the last node. 


A circular double linked list is shown in figure 





Creating a Circular Double Linked List with ‘n’ number of nodes 
The basic operations in a circular double linked list are: 
e Creation. 
e Insertion. 
e Deletion. 


e Traversing. 


Inserting a node at the beginning: 


DESETE NERC 
300 


200 





Inserting a node at the end: 
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200 


100 300 





Inserting a node at an intermediate position: 











Deleting a node at the beginning: 





Deleting a node at the end: 
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Deleting a node at Intermediate position: 








Comparison of Linked List Variations: 


The major disadvantage of doubly linked lists (over singly linked lists) is that they require more space 
(every node has two pointer fields instead of one). Also, the code to manipulate doubly linked lists needs 
to maintain the prev fields as well as the next fields; the more fields that have to be maintained, the more 
chance there is for errors. 


The major advantage of doubly linked lists is that they make some operations (like the removal of a given 
node, or a right-to-left traversal of the list) more efficient. 


The major advantage of circular lists (over non-circular lists) is that they eliminate some extra-case code 
for some operations (like deleting last node). Also, some applications lead naturally to circular list 
representations. For example, a computer network might best be modeled using a circular list. 


Polynomials: 
A polynomial is of the form: ini) ci 
Where, ci is the coefficient of the ith term and 
nis the degree of the polynomial 

Some examples are: 

5x2 + 3x+1 

12x3 — 4x 
5x4 — 8x3 + 2x2 + 4x1 + 9x0 
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It is not necessary to write terms of the polynomials in decreasing order of degree. In other words the two 
polynomials 1 + x and x + 1 are equivalent. 


The computer implementation requires implementing polynomials as a list of pairs of coefficient and 
exponent. Each of these pairs will constitute a structure, so a polynomial will be represented as a list of 
structures. 


A linked list structure that represents polynomials 5x4 — 8x3 + 2x2 + 4x1 + 9x0 


Coefficient | Exponent 


K 





Addition of Polynomials: 


To add two polynomials we need to scan them once. If we find terms with the same exponent in the two 
polynomials, then we add the coefficients; otherwise, we copy the term of larger exponent into the sum 
and go on. When we reach at the end of one of the polynomial, then remaining part of the other is copied 
into the sum. 


To add two polynomials follow the following steps: 
e Read two polynomials 

e Add them. 

e Display the resultant polynomial. 
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UNIT —IV NON LINEAR DATA STRUCTURES 


Trees Basic Concepts: 


A tree is a non-empty set one element of which is designated the root of the tree while the remaining 
elements are partitioned into non-empty sets each of which is a sub-tree of the root. 


A tree T is a set of nodes storing elements such that the nodes have a parent-child relationship that 
satisfies the following 


e If T is not empty, T has a special tree called the root that has no parent. 


e Each node v of T different than the root has a unique parent node w; each node with parent w is a 
child of w. 


Tree nodes have many useful properties. The depth of a node is the length of the path (or the number of 
edges) from the root to that node. The height of a node is the longest path from that node to its leaves. 
The height of a tree is the height of the root. A leaf node has no children -- its only path is up to its 
parent. 

































































Binary Tree: 


In a binary tree, each node can have at most two children. A binary tree is either empty or consists of a 
node called the root together with two binary trees called the left subtree and the right subtree. 
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the ‘root node’ 





the ‘right node 
the ‘left' node 


‘sibling’ nodes 
A BINARY TREE 
Tree Terminology: 


Leaf node 


A node with no children is called a leaf (or external node). A node which is not a leaf is called an internal 
node. 


Path: A sequence of nodes ny, no, . . ., Nk, such that n; is the parent of n;,,; fori= 1, 2,...,k- 1. The 
length of a path is 1 less than the number of nodes on the path. Thus there is a path of length zero from a 
node to itself. 


Siblings: The children of the same parent are called siblings. 


Ancestor and Descendent If there is a path from node A to node B, then A is called an ancestor of B 
and B is called a descendent of A. 


Subtree: Any node of a tree, with all of its descendants is a subtree. 


Level: The level of the node refers to its distance from the root. The root of the tree has level O, and the 
level of any other node in the tree is one more than the level of its parent. 


The maximum number of nodes at any level is 2". 


Height:The maximum level in a tree determines its height. The height of a node in a tree is the length of a 
longest path from the node to a leaf. The term depth is also used to denote height of the tree. 


Depth:The depth of a node is the number of nodes along the path from the root to that node. 
Assigning level numbers and Numbering of nodes for a binary tree: The nodes of a binary tree can be 
numbered in a natural way, level by level, left to right. The nodes of a complete binary tree can be 


numbered so that the root is assigned the number 1, a left child is assigned twice the number assigned its 
parent, and a right child is assigned one more than twice the number assigned its parent. 
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Properties of Binary Trees: 


> nd Level 3 


Some of the important properties of a binary tree are as follows: 
1. Ifh=height of a binary tree, then 
a. Maximum number of leaves = 2" 
b. Maximum number of nodes = 2"*! - 1 
2. Ifa binary tree contains m nodes at level 1, it contains at most 2m nodes at level 1 + 1. 


3. Since a binary tree can contain at most one node at level O (the root), it can contain at most 2, 
node at level 1. 


4. The total number of edges in a full binary tree with n node is n - 1. 
Strictly Binary tree: 
If every non-leaf node in a binary tree has nonempty left and right subtrees, the tree is termed a strictly 


binary tree. Thus the tree of figure 7.2.3(a) is strictly binary. A strictly binary tree with n leaves always 
contains 2n - 1 nodes. 


a Sbk t Binary Tree 


{a} 
G 
fs) @| 


Full Binary Tree: 


A full binary tree of height h has all its leaves at level h. Alternatively; All non leaf nodes of a full binary 
tree have two children, and the leaf nodes have no children. 


A full binary tree with height h has 2°*' - 1 nodes. A full binary tree of height h is a strictly binary tree all 
of whose leaves are at level h. 
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Full binary tree {cd} 


For example, a full binary tree of height 3 contains 2**' — 1 = 15 nodes. 


Complete Binary Tree: 


A binary tree with n nodes is said to be complete if it contains all the first n nodes of the above 
numbering scheme. 


A complete binary tree of height h looks like a full binary tree down to level h-1, and the level h is filled 
from left to right. 





Complete binary tree {c} 
Perfect Binary Tree: 


A Binary tree is Perfect Binary Tree in which all internal nodes have two children and all leaves are at 
same level. 
Following are examples of Perfect Binary Trees. 


18 
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18 


15 30 


A Perfect Binary Tree of height h (where height is number of nodes on path from root to leaf) has 2" — 1 
node. 


Example of Perfect binary tree is ancestors in family. Keep a person at root, parents as children, parents 
of parents as their children. 


Balanced Binary Tree: 


A binary tree is balanced if height of the tree is O(Log n) where n is number of nodes. For Example, AVL 
tree maintain O(Log n) height by making sure that the difference between heights of left and right 
subtrees is 1. Red-Black trees maintain O(Log n) height by making sure that the number of Black nodes 
on every root to leaf paths are same and there are no adjacent red nodes. Balanced Binary Search trees are 
performance wise good as they provide O(log n) time for search, insert and delete. 


Representation of Binary Trees: 
1. Array Representation of Binary Tree 
2. Pointer-based. 

Array Representation of Binary Tree: 


A single array can be used to represent a binary tree. 
For these nodes are numbered / indexed according to a scheme giving O to root. Then all the 
nodes are numbered from left to right level by level from top to bottom. Empty nodes are also 


numbered. Then each node having an index 1 1s put into the array as its i™ element. 


In the figure shown below the nodes of binary tree are numbered according to the given scheme. 
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The figure shows how a binary tree is represented as an array. The root 3 is the 0” element while its 
leftchild 5 is the 1 st element of the array. Node 6 does not have any child so its children i.e. 7" and 8" 
element of the array are shown as a Null value. 


It is found that if n is the number or index of a node, then its left child occurs at (2n + 1)" position and 
right child at (2n + 2)" position of the array. If any node does not have any of its child, then null value is 
stored at the corresponding index of the array. 


The following program implements the above binary tree in an array form. And then traverses 
the tree in inorder traversal. 


# Python implementation to construct a Binary Tree from 
# parent array 


# A node structure 
class Node: 
# A utility function to create a new node 
def __ init__(self, key): 
self. key = key 
self. left = None 
self right = None 


""" Creates a node with key as '1'. If 1 is root,then 
it changes root. If parent of 1 is not created, then 
it creates parent first 


def createNode(parent, 1, created, root): 
# If this node is already created 
if created[i] is not None: 


return 


# Create a new node and set created[1] 
created[i] = Node(i) 


# If '1' is root, change root pointer and return 
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if parent[1] == -1: 
root[O] = created[1] # root[O] denotes root of the tree 
return 


# If parent is not created, then create parent first 
if created[parent[i]] is None: 
createNode(parent, parent[1], created, root ) 


# Find parent pointer 
p = created[parent[1]] 


# If this is first child of parent 
if p.left 1s None: 
p.left = created|[1] 
# If second child 
else: 
p.right = created|[1] 


# Creates tree from parent[0..n-1] and returns root of the 
# created tree 
def createTree(parent): 

n = len(parent) 


# Create and array created[] to keep track 
# of created nodes, initialize all entries as None 
created = [None for 1 in range(n+1)] 


root = [None] 
for 1 in range(n): 
createNode(parent, 1, created, root) 


return root[0] 


#Inorder traversal of tree 
def inorder(root): 
if root is not None: 
inorder(root.left) 
print root.key, 
inorder(root.right) 


# Driver Method 

parent = [-1, 0, 0, 1, 1, 3, 5] 

root = create Tree(parent) 

print "Inorder Traversal of constructed tree" 

inorder(root) 

Linked Representation of Binary Tree (Pointer based): 
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Binary trees can be represented by links where each node contains the address of the left child 
and the right child. If any node has its left or right child empty then it will have in its respective 
link field, a null value. A leaf node has null value in both of its links. 


# Python program to create a Complete Binary Tree from 
# its linked list representation 


# Linked List node 
class ListNode: 


# Constructor to create a new node 
def init__(self, data): 

self.data = data 

self.next = None 


# Binary Tree Node structure 
class BinaryTreeNode: 


# Constructor to create a new node 
def init__(self, data): 

self.data = data 

self. left = None 

self right = None 


# Class to convert the linked list to Binary Tree 
class Conversion: 


# Constructor for storing head of linked list 
# and root for the Binary Tree 
def _ init__(self, data = None): 
self.head = None 
self.root = None 
def push(self, new_data): 


# Creating a new linked list node and storing data 
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new_node = ListNode(new_ data) 


# Make next of new node as head 
new_node.next = self.head 


# Move the head to point to new node 
self.head = new_node 


def convertList2Binary(self): 


# Queue to store the parent nodes 
q=[] 


# Base Case 

if self.head is None: 
self.root = None 
return 


# 1.) The first node is always the root node, 
# and add it to the queue 

self.root = BinaryTreeNode(self.head.data) 
q.append(self.root) 


# Advance the pointer to the next node 
self.head = self.head.next 


# Until th end of linked list is reached, do: 
while(self.head): 


# 2.a) Take the parent node from the q and 
# and remove it from q 
parent = q.pop(0) # Front of queue 


# 2.c) Take next two nodes from the linked list. 
# We will add them as children of the current 
# parent node in step 2.b. 
# Push them into the queue so that they will be 
# parent to the future node 

leftChild= None 

rightChild = None 


leftChild = BinaryTreeNode(self.head.data) 
q.append(leftChild) 
self.head = self.head.next 
if(self.head): 
rightChild = BinaryTreeNode(self.head.data) 
q.append(rightChild) 
self.head = self.head.next 


#2.b) Assign the left and right children of parent 
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parent.left = leftChild 
parent.right = rightChild 


def inorderTraversal(self, root): 
if(root): 
self.inorderTraversal(root.left) 
print root.data, 
self.inorderTraversal(root.right) 


# Driver Program to test above function 


# Object of conversion class 
conv = Conversion() 
conv.push(36) 
conv.push(30) 
conv.push(25) 
conv.push(15) 
conv.push(12) 
conv.push(10) 


conv.convertList2Binary() 


print "Inorder Traversal of the contructed Binary Tree is:" 
conv.inorderTraversal(conv.root) 


Binary Tree Traversals: 


Traversal of a binary tree means to visit each node in the tree exactly once. The tree traversal is used in all t 
it. 





In a linear list nodes are visited from first to last, but a tree being a non linear one we need definite rules. Tt 
ways to traverse a tree. All of them differ only in the order in which they visit the nodes. 


The three main methods of traversing a tree are: 
=  I[norder Traversal 
= Preorder Traversal 


= Postorder Traversal 


In all of them we do not require to do anything to traverse an empty tree. All the traversal methods are bases 
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functions since a binary tree is itself recursive as every child of a node in a binary tree is itself a binary tree. 


Inorder Traversal: 

To traverse a non empty tree in inorder the following steps are followed recursively. 
= Visit the Root 
= Traverse the left subtree 


= ‘Traverse the right subtree 


The inorder traversal of the tree shown below 1s as follows. 


TN root 





inorder: 8, 15. 20, 30, 35, 43, 50, 60. 70, 82 


Preorder Traversal: 
Algorithm Pre-order(tree) 


1. Visit the root. 
2. Traverse the left sub-tree, 1.e., call Pre-order(left-sub-tree) 
3. Traverse the right sub-tree, 1.e., call Pre-order(right-sub-tree) 


Post-order Traversal: 


Algorithm Post-order(tree) 


1. Traverse the left sub-tree, 1.e., call Post-order(left-sub-tree) 
2. Traverse the right sub-tree, 1.e., call Post-order(right-sub-tree) 
3. Visit the root. 


# Python program for tree traversals 


# A class that represents an individual node in a 
# Binary Tree 
class Node: 
def init__(self, key): 
self.left = None 
self right = None 
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self.val = key 
# A function to do inorder tree traversal 
def printInorder(root): 
if root: 


# First recur on left child 
printInorder(root.left) 


# then print the data of node 
print(root.val), 


# now recur on right child 
printInorder(root.right) 


# A function to do postorder tree traversal 
def printPostorder(root): 


if root: 


# First recur on left child 
printPostorder(root.left) 


# the recur on right child 
printPostorder(root.right) 


# now print the data of node 
print(root.val), 
# A function to do postorder tree traversal 
def printPreorder(root): 


if root: 


# First print the data of node 
print(root.val), 


# Then recur on left child 
printPreorder(root.left) 


# Finally recur on right child 
printPreorder(root.right) 


# Driver code 
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root = Node(1) 

root.left = Node(2) 

root.right = Node(3) 

root.left.left = Node(4) 

root.left.right = Node(5) 

print "Preorder traversal of binary tree 1s" 
printPreorder(root) 


print "\nInorder traversal of binary tree is" 
printInorder(root) 


print "\nPostorder traversal of binary tree 1s" 
printPostorder(root) 


Time Complexity: O(n^2). 


Binary Search Tree: 


Binary Search Tree, is a node-based binary tree data structure which has the following properties: 


= The left sub-tree of a node contains only nodes with keys less than the node’s key. 
= The right sub-tree of a node contains only nodes with keys greater than the node’s key. 
= The left and right sub-tree each must also be a binary search tree. 

There must be no duplicate nodes. 





The above properties of Binary Search Tree provide an ordering among keys so that the operations like 
search, minimum and maximum can be done fast. If there is no ordering, then we may have to compare 
every key to search a given key. 


Searching a key 
To search a given key in Binary Search Tree, we first compare it with root, if the key is present at root, 
we return root. If key is greater than root’s key, we recur for right sub-tree of root node. Otherwise we 
recur for left sub-tree. 
# A utility function to search a given key in BST 
def search(root,key): 

# Base Cases: root is null or key is present at root 
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if root is None or root.val == key: 
return root 
# Key is greater than root's key 
if root.val < key: 
return search(root.right,key) 
# Key is smaller than root's key 
return search(root.left,key) 


Priority Queues 


Priority Queue is an extension of queue with following properties. 

1) Every item has a priority associated with it. 

2) An element with high priority is dequeued before an element with low priority. 

3) If two elements have the same priority, they are served according to their order in the queue. 


A typical priority queue supports following operations. 
insert(item, priority): Inserts an item with given priority. 
getHighestPriority(): Returns the highest priority item. 
deleteHighestPriority(): Removes the highest priority item. 


Implementation priority queue 
Using Array: A simple implementation is to use array of following structure. 
insert() operation can be implemented by adding an item at end of array in O(1) time. 


getHighestPriority() operation can be implemented by linearly searching the highest priority item in 
array. This operation takes O(n) time. 


deleteHighestPriority() operation can be implemented by first linearly searching an item, then removing 
the item by moving all subsequent items one position back. 


We can also use Linked List, time complexity of all operations with linked list remains same as array. The 
advantage with linked list is deleteHighestPriority() can be more efficient as we don’t have to move 
items. 

Applications of Priority Queue: 

1) CPU Scheduling 

2) Graph algorithms like Dykstra’s shortest path algorithm, Prim’s Minimum Spanning Tree, etc 

3) All queue applications where priority is involved. 


Application of Trees: 


1. One reason to use trees might be because you want to store information that naturally forms a 
hierarchy. For example, the file system on a computer: 


file system 


/ <-- root 
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2. If we organize keys in form of a tree (with some ordering e.g., BST), we can search for a 
given key in moderate time (quicker than Linked List and slower than arrays). Self-balancing 
search trees like AVL and Red-Black trees guarantee an upper bound of O(log,) for search. 


3) We can insert/delete keys in moderate time (quicker than Arrays and slower than Unordered 
Linked Lists). Self-balancing search trees like AVL and Red-Black trees guarantee an upper 
bound of O(log, ) for insertion/deletion. 


4) Like Linked Lists and unlike Arrays, Pointer implementation of trees don’t have an upper 
limit on number of nodes as nodes are linked using pointers. 


The following are the common uses of tree. 

1. Manipulate hierarchical data. 

2. Make information easy to search (see tree traversal). 

3. Manipulate sorted lists of data. 

4. As a workflow for compositing digital images for visual effects. 
5. Router algorithms 

Basic Graph Concepts: 


Graph is a data structure that consists of following two components: 

1. A finite set of vertices also called as nodes. 

2. A finite set of ordered pair of the form (u, v) called as edge. 

The pair is ordered because (u, v) is not same as (v, u) in case of directed graph (di-graph). The pair of 


form (u, v) indicates that there is an edge from vertex u to vertex v. The edges may contain 
weight/value/cost. 
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Graph and its representations: 


Graphs are used to represent many real life applications: Graphs are used to represent networks. The 
networks may include paths in a city or telephone network or circuit network. Graphs are also used in 
social networks like linkedIn, facebook. For example, in facebook, each person is represented with a 
vertex(or node). Each node is a structure and contains information like person id, name, gender and 
locale. 


Following is an example undirected graph with 5 vertices. 





Following two are the most commonly used representations of graph. 
1. Adjacency Matrix 
2. Adjacency List 


There are other representations also like, Incidence Matrix and Incidence List. The choice of the graph 
representation is situation specific. It totally depends on the type of operations to be performed and ease 
of use. 


Adjacency Matrix: 

Adjacency Matrix is a 2D array of size V x V where V is the number of vertices in a graph. Let the 2D 
array be adj[][], a slot adj[i][j] = 1 indicates that there is an edge from vertex 1 to vertex j. Adjacency 
matrix for undirected graph is always symmetric. Adjacency Matrix is also used to represent weighted 


graphs. If adj[i][j] = w, then there is an edge from vertex i to vertex j with weight w. 


The adjacency matrix for the above example graph is: 


0 I 2? 3 4 
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Adjacency Matrix Representation of the above graph 


Pros: Representation is easier to implement and follow. Removing an edge takes O(1) time. Queries like 
whether there is an edge from vertex ‘vw’ to vertex ‘v’ are efficient and can be done O(1). 


Cons: Consumes more space O(V%2). Even if the graph is sparse (contains less number of edges), it 
consumes the same space. Adding a vertex is O(V^2) time. 


Adjacency List: 


An array of linked lists is used. Size of the array is equal to number of vertices. Let the array be array[]. 
An entry array[i] represents the linked list of vertices adjacent to the ith vertex. This representation can 
also be used to represent a weighted graph. The weights of edges can be stored in nodes of linked lists. 
Following is adjacency list representation of the above graph. 


o| +0 4/4 

ı Ho 4] ee 
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Adjacency List Representation of the above Graph 
Breadth First Traversal for a Graph 
Breadth First Traversal (or Search) for a graph is similar to Breadth First Traversal of a tree The only 


catch here is, unlike trees, graphs may contain cycles, so we may come to the same node again. To avoid 
processing a node more than once, we use a boolean visited array. 


For example, in the following graph, we start traversal from vertex 2. When we come to vertex 0, we look 
for all adjacent vertices of it. 2 is also an adjacent vertex of 0. If we don’t mark visited vertices, then 2 
will be processed again and it will become a non-terminating process. Breadth First Traversal of the 
following graph is 2, 0, 3, 1. 
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Algorithm: Breadth-First Search Traversal 


BFS(V, E, s) 


for each u in V — {s} 
do color[u] — WHITE 
d[u] — infinity 
mu] — NIL 
color[s] — GRAY 


d[s] — 0 
nis] — NIL 
Q= 
ENQUEUE(Q, s) 
while Q is non-empty 
do u — DEQUEUE(Q) 
for each v adjacent to u 
do if color[v] — WHITE 
then color[v] — GRAY 
d[v] — dļu] + 1 
mv] — u 
ENQUEUE(Q, v) 


DEQUEUE(Q) 
color[u] — BLACK 
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Applications of Breadth First Traversal 


1) Shortest Path and Minimum Spanning Tree for unweighted graph In unweighted graph, the 
shortest path is the path with least number of edges. With Breadth First, we always reach a vertex from 
given source using minimum number of edges. Also, in case of unweighted graphs, any spanning tree is 
Minimum Spanning Tree and we can use either Depth or Breadth first traversal for finding a spanning 
tree. 


2) Peer to Peer Networks. In Peer to Peer Networks like BitTorrent, Breadth First Search is used to find 
all neighbor nodes. 


3) Crawlers in Search Engines: Crawlers build index using Bread First. The idea is to start from source 
page and follow all links from source and keep doing same. Depth First Traversal can also be used for 
crawlers, but the advantage with Breadth First Traversal is, depth or levels of built tree can be limited. 


4) Social Networking Websites: In social networks, we can find people within a given distance ‘k’ from 
a person using Breadth First Search till ‘k’ levels. 


5) GPS Navigation systems: Breadth First Search is used to find all neighboring locations. 


6) Broadcasting in Network: In networks, a broadcasted packet follows Breadth First Search to reach all 
nodes. 


7) In Garbage Collection: Breadth First Search is used in copying garbage collection using Cheney’s 
algorithm. 


8) Cycle detection in undirected graph: In undirected graphs, either Breadth First Search or Depth First 
Search can be used to detect cycle. In directed graph, only depth first search can be used. 


9) Ford—Fulkerson algorithm In Ford-Fulkerson algorithm, we can either use Breadth First or Depth 
First Traversal to find the maximum flow. Breadth First Traversal is preferred as it reduces worst case 
time complexity to O(VE’). 


10) To test if a graph is Bipartite We can either use Breadth First or Depth First Traversal. 


11) Path Finding We can either use Breadth First or Depth First Traversal to find if there is a path 
between two vertices. 


12) Finding all nodes within one connected component: We can either use Breadth First or Depth First 
Traversal to find all nodes reachable from a given node. 


Depth First Traversal for a Graph 


Depth First Traversal (or Search) for a graph is similar to Depth First Traversal of a tree. The only catch 
here is, unlike trees, graphs may contain cycles, so we may come to the same node again. To avoid 
processing a node more than once, we use a boolean visited array. 
For example, in the following graph, we start traversal from vertex 2. When we come to vertex 0, we look 
for all adjacent vertices of it. 2 is also an adjacent vertex of 0. If we don’t mark visited vertices, then 2 
will be processed again and it will become a non-terminating process. Depth First Traversal of the 
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following graph is 2, 0, 1, 3 


start (2) (3 ) 





Algorithm Depth-First Search 


The DFS forms a depth-first forest comprised of more than one depth-first trees. Each tree is made of 


edges (u, v) such that u is gray and v is white when edge (u, v) is explored. The following pseudocode for 
DFS uses a global timestamp time. 


DFS (V, E) 


for each vertex u in V[G] 
do color[u] — WHITE 
mu] — NIL 
time < 0 
for each vertex u in V[G] 
do if color[u] — WHITE 
then DFS-Visit(u) 


DFS-Visit(u) 


color[u] < GRAY 

time — time + 1 

d[u] < time 

for each vertex v adjacent to u 


do if color[v] — WHITE 
then z[v] — u 
DFS-Visit(v) 
color[u] — BLACK 
time < time + 1 
flu] < time 


Applications of Depth First Search 
Depth-first search (DFS) is an algorithm (or technique) for traversing a graph. 


Following are the problems that use DFS as a building block. 
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1) For an unweighted graph, DFS traversal of the graph produces the minimum spanning tree and all pair 
shortest path tree. 


2) Detecting cycle in a graph 
A graph has cycle if and only if we see a back edge during DFS. So we can run DFS for the graph and 
check for back edges. (See this for details) 


3) Path Finding 

We can specialize the DFS algorithm to find a path between two given vertices u and z. 
1) Call DFS(G, u) with u as the start vertex. 

11) Use a stack S to keep track of the path between the start vertex and the current vertex. 
111) As soon as destination vertex z 1s encountered, return the path as the 

contents of the stack 


4) Topological Sorting 


5) To test if a graph is bipartite 

We can augment either BFS or DFS when we first discover a new vertex, color it opposite its parents, and 
for each other edge, check it doesn’t link two vertices of the same color. The first vertex in any connected 
component can be red or black! See this for details. 


6) Finding Strongly Connected Components of a graph A directed graph is called strongly connected 
if there is a path from each vertex in the graph to every other vertex. (See this for DFS based also for 
finding Strongly Connected Components) 


C program to implement the Graph Traversal 
(a) Breadth first traversal 
(b) Depth first traversal 


DFS search starts from root node then traversal into left child node and continues, if item found it stops 
otherwise it continues. The advantage of DFS is it requires less memory compare to Breadth First Search 
(BFS). 


# Python program to print DFS traversal from a 
# given given graph 
from collections import defaultdict 


# This class represents a directed graph using 
# adjacency list representation 
class Graph: 


# Constructor 
def __init _ (self: 


# default dictionary to store graph 
self.graph = defaultdict(list) 


# function to add an edge to graph 
def addEdge(self,u,v): 
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self. graph[u].append(v) 


# A function used by DFS 
def DFS Util(self,v,visited): 


# Mark the current node as visited and print it 
visited[v]= True 
print (v), 


# Recur for all the vertices adjacent to this vertex 
for 1 in self.graph[v]: 
if visited[i] == False: 
self. DFSUtlG, visited) 


# The function to do DFS traversal. It uses 
# recursive DFSUtilQ 
def DFS(self,v): 


# Mark all the vertices as not visited 
visited = [False]*(len(self.graph)) 


# Call the recursive helper function to print 
# DFS traversal 
self. DFS Util(v, visited) 


# Driver code 

# Create a graph given in the above diagram 
g = Graph() 

g.addEdge(0, 1) 

g.addEdge(0, 2) 

g.addEdge(1, 2) 

g.addEdge(2, 0) 

g.addEdge(2, 3) 

g.addEdge(3, 3) 


print ("Following is DFS from (starting from vertex 2)") 
g.DFS(2) 


BFS search starts from root node then traversal into next level of graph or tree and continues, if item 
found it stops otherwise it continues. The disadvantage of BFS is it requires more memory compare to 
Depth First Search (DFS). 


# Program to print BFS traversal from a given source 
# vertex. BFS(int s) traverses vertices reachable 

# from s. 

from collections import defaultdict 
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# This class represents a directed graph using adjacency 
# list representation 
class Graph: 


# Constructor 
def _ init__(self): 


# default dictionary to store graph 
self. graph = defaultdict(list) 


# function to add an edge to graph 
def addEdge(self,u,v): 
self. graph[u].append(v) 


# Function to print a BFS of graph 
def BFS(self, s): 


# Mark all the vertices as not visited 
visited = [False]*(len(self.graph)) 


# Create a queue for BFS 
queue = [|] 


# Mark the source node as visited and enqueue it 
queue.append(s) 
visited[s] = True 


while queue: 


# Dequeue a vertex from queue and print it 
s = queue.pop(0) 
print (s) 


# Get all adjacent vertices of the dequeued 
# vertex s. If a adjacent has not been visited, 
# then mark it visited and enqueue it 
for 1 in self.graph[s]: 
if visited[i] == False: 
queue.append(1) 
visited[i] = True 


# Driver code 
# Create a graph given in the above diagram 
g = Graph() 
g.addEdge(0, 1) 
g.addEdge(0, 2) 
g.addEdge(1, 2) 
g.addEdge(2, 0) 
g.addEdge(2, 3) 
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g.addEdge(3, 3) 


print ("Following is Breadth First Traversal (starting from vertex 2)") 
g.BFS(2) 
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UNIT —- V BINARY TREES AND HASHING 


Binary Search Trees: 


An important special kind of binary tree is the binary search tree (BST). In a BST, each node 
stores some information including a unique key value, and perhaps some associated data. A 
binary tree is a BST iff, for every node n in the tree: 


e All keys in n's left subtree are less than the key in n, and 
e All keys in n's right subtree are greater than the key in n. 


In other words, binary search trees are binary trees in which all values in the node’s left subtree are less 
than node value all values in the node’s right subtree are greater than node value. 





Here are some BSTs in which each node just stores an integer key: 

the 
ow ARA 
ts U PJ) BJF BJ P 
These are not BSTs: 

a 
O ZNS A A 
is a J EJ EJ fe 
In the left one 5 is not greater than 6. In the right one 6 is not greater than 7. 


The reason binary-search trees are important is that the following operations can be implemented 
efficiently using a BST: 


e insert a key value 
e determine whether a key value is in the tree 
e remove a key value from the tree 


e print all of the key values in sorted order 


Let's illustrate what happens using the following BST: 
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and searching for 12: 


l2 « 13 so goto 


left subtree 





l2 > Ẹł so goto 


cight subtree. 





found! 





What if we search for 15: 
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15> 13 so goto 


cight subtree 


15 « 16 so goto 
left subtree. It 
does not exist so 


search tails and it 





cretucns false 


Properties and Operations: 


A BST is a binary tree of nodes ordered in the following way: 


Each node contains one key (also unique) 

The keys in the left subtree are < (less) than the key in its parent node 
The keys in the right subtree > (greater) than the key in its parent node 
Duplicate node keys are not allowed. 


SS 


Inserting a node 


A naive algorithm for inserting a node into a BST is that, we start from the root node, if the node to insert 
is less than the root, we go to left child, and otherwise we go to the right child of the root. We continue 
this process (each node is a root for some sub tree) until we find a null pointer (or leaf node) where we 
cannot go any further. We then insert the node as a left or right child of the leaf node based on node 1s less 
or greater than the leaf node. We note that a new node is always inserted as a leaf node. A recursive 
algorithm for inserting a node into a BST is as follows. Assume we insert a node N to tree T. if the tree is 
empty, the we return new node N as the tree. Otherwise, the problem of inserting is reduced to inserting 
the node N to left of right sub trees of T, depending on N is less or greater than T. A definition is as 
follows. 


Insert(N, T) =N if T is empty 
= insert(N, T.left) if N < T 
= insert(N, T.right) if N > T 


Searching for a node 
Searching for a node is similar to inserting a node. We start from root, and then go left or right until we 
find (or not find the node). A recursive definition of search is as follows. If the node is equal to root, then 
we return true. If the root is null, then we return false. Otherwise we recursively solve the problem for 
T left or T.right, depending on N < T or N > T. A recursive definition is as follows. 
Search should return a true or false, depending on the node is found or not. 
Search(N, T) = false if T 1s empty 

= true ifT=N 
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= search(N, T.left) if N < T 
= search(N, T.right) if N > T 
Deleting a node 


A BST is a connected structure. That is, all nodes in a tree are connected to some other node. For 
example, each node has a parent, unless node is the root. Therefore deleting a node could affect all sub 


trees of that node. For example, deleting node 5 from the tree could result in losing sub trees that are 
rooted at 1 and 9. 


: 15 46 


Hence we need to be careful about deleting nodes from a tree. The best way to deal with deletion seems to 
be considering special cases. What if the node to delete is a leaf node? What if the node is a node with 
just one child? What if the node is an internal node (with two children). The latter case is the hardest to 
resolve. But we will find a way to handle this situation as well. 

Case 1 : The node to delete is a leaf node 

This is a very easy case. Just delete the node 46. We are done 


LD 


3 


Case 2 : The node to delete is a node with one child. 

This is also not too bad. If the node to be deleted is a left child of the parent, then we connect the left 
pointer of the parent (of the deleted node) to the single child. Otherwise if the node to be deleted is a right 
child of the parent, then we connect the right pointer of the parent (of the deleted node) to single child. 
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Case 3: The node to delete is a node with two children 

This is a difficult case as we need to deal with two sub trees. But we find an easy way to handle it. First 
we find a replacement node (from leaf node or nodes with one child) for the node to be deleted. We need 
to do this while maintaining the BST order property. Then we swap leaf node or node with one child with 
the node to be deleted (swap the data) and delete the leaf node or node with one child (case 1 or case 2) 


Next problem is finding a replacement leaf node for the node to be deleted. We can easily find this as 
follows. If the node to be deleted is N, the find the largest node in the left sub tree of N or the smallest 
node in the right sub tree of N. These are two candidates that can replace the node to be deleted without 


losing the order property. For example, consider the following tree and suppose we need to delete the root 
38. 


38 
/ \ 
S 45 
KON \ 
i 9 47 
' A. ` / 
= 15 46 
/ 


Then we find the largest node in the left sub tree (15) or smallest node in the right sub tree (45) and 
replace the root with that node and then delete that node. The following set of images demonstrates this 
process. 


13 L3 13 


Let’s see when we delete 13 from that tree. 
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Criginal BST with 
13 located 





step inte right 


subtree. [5 | | 13 | 


(toa to lett child. 


> 
Continue to left 


child. This is last [5] | 13 | 


GOLE. 


T 

> 
Eeplace node to 
delete with far left [19 | 


child of ngeht subtree. 





Remove far left child 


of right subtree. | 5 | | 15 | 
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Balanced Search Trees: 


A self-balancing (or height-balanced) binary search tree is any node-based binary search tree 
that automatically keeps its height (maximal number of levels below the root) small in the face of 
arbitrary item insertions and deletions. The red—black tree, which is a type of self-balancing 
binary search tree, was called symmetric binary B-tree. Self-balancing binary search trees can be 
used in a natural way to construct and maintain ordered lists, such as priority queues. They can 
also be used for associative arrays; key-value pairs are simply inserted with an ordering based on 
the key alone. In this capacity, self-balancing BSTs have a number of advantages and 
disadvantages over their main competitor, hash tables. One advantage of self-balancing BSTs 1s 
that they allow fast (indeed, asymptotically optimal) enumeration of the items in key order, 
which hash tables do not provide. One disadvantage is that their lookup algorithms get more 
complicated when there may be multiple items with the same key. Self-balancing BSTs have 
better worst-case lookup performance than hash tables (O(log n) compared to O(n)), but have 
worse average-case performance (O(log n) compared to O(1)). 


Self-balancing BSTs can be used to implement any algorithm that requires mutable ordered lists, 
to achieve optimal worst-case asymptotic performance. For example, if binary tree sort is 
implemented with a self-balanced BST, we have a very simple-to-describe yet asymptotically 
optimal O(n log n) sorting algorithm. Similarly, many algorithms in computational geometry 
exploit variations on self-balancing BSTs to solve problems such as the line segment intersection 
problem and the point location problem efficiently. (For average-case performance, however, 
self-balanced BSTs may be less efficient than other solutions. Binary tree sort, in particular, is 
likely to be slower than merge sort, quicksort, or heapsort, because of the tree-balancing 
overhead as well as cache access patterns.) 


Self-balancing BSTs are flexible data structures, in that it's easy to extend them to efficiently 
record additional information or perform new operations. For example, one can record the 
number of nodes in each subtree having a certain property, allowing one to count the number of 
nodes in a certain key range with that property in O(log n) time. These extensions can be used, 
for example, to optimize database queries or other list-processing algorithms. 

AVL Trees: 


An AVL tree is another balanced binary search tree. Named after their inventors, Adelson- 
Velskii and Landis, they were the first dynamically balanced trees to be proposed. Like red-black 
trees, they are not perfectly balanced, but pairs of sub-trees differ in height by at most 1, 
maintaining an O(logn) search time. Addition and deletion operations also take O(logn) time. 


Definition of an AVL tree: An AVL tree is a binary search tree which has the following 
properties: 


1. The sub-trees of every node differ in height by at most one. 
2. Every sub-tree is an AVL tree. 
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Balance 
requirement 
for an AVL t 
tree: the left 


and right h h-} 


sub-trees 
differ by at 
most 1 in 


height. 





For example, here are some trees: 





Yes this is an AVL tree. Examination shows that each left sub-tree has a height 1 greater than 
each right sub-tree. 





No this is not an AVL tree. Sub-tree with root 8 has height 4 and sub-tree with root 18 has 
height 2. 

An AVL tree implements the Map abstract data type just like a regular binary search tree, the 
only difference is in how the tree performs. To implement our AVL tree we need to keep track 
of a balance factor for each node in the tree. We do this by looking at the heights of the left and 
right subtrees for each node. More formally, we define the balance factor for a node as the 
difference between the height of the left subtree and the height of the right subtree. 

balanceFactor=height(leftSubTree)—height(rightSubTree) 
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Using the definition for balance factor given above we say that a subtree is left-heavy if the 
balance factor is greater than zero. If the balance factor is less than zero then the subtree is right 
heavy. If the balance factor is zero then the tree is perfectly in balance. For purposes of 
implementing an AVL tree, and gaining the benefit of having a balanced tree we will define a 
tree to be in balance if the balance factor is -1, 0, or 1. Once the balance factor of a node in a 
tree is outside this range we will need to have a procedure to bring the tree back into balance. 
Figure shows an example of an unbalanced, right-heavy tree and the balance factors of each 
node. 





Properties of AVL Trees 


AVL trees are identical to standard binary search trees except that for every node in an AVL 
tree, the height of the left and right subtrees can differ by at most 1 (Weiss, 1993, p:108). AVL 
trees are HB-k trees (height balanced trees of order k) of order HB-1. 


The following is the height differential formula: 


|Height(Tz,) — Height(T'R)| < k 


When storing an AVL tree, a field must be added to each node with one of three values: 1, 0, or 
-1. A value of 1 in this field means that the left subtree has a height one more than the right 
subtree. A value of -1 denotes the opposite. A value of O indicates that the heights of both 
subtrees are the same. Updates of AVL trees require up to *'-=7*3rotations, whereas updating 
red-black trees can be done using only one or two rotations (up to 7“'° color changes). For 
this reason, they (AVL trees) are considered a bit obsolete by some. 


Sparse AVL trees 
Sparse AVL trees are defined as AVL trees of height h with the fewest possible nodes. Figure 3 


shows sparse AVL trees of heights 0, 1, 2, and 3. 
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height=] 





heicht=2 





height=3 
Figure Structure of an AVL tree 


Introduction to M-Way Search Trees: 


A multiway tree is a tree that can have more than two children. A multiway tree of order m (or 
an m-way tree) is one in which a tree can have m children. 


As with the other trees that have been studied, the nodes in an m-way tree will be made up of key 
fields, in this case m-1 key fields, and pointers to children. 


Multiday tree of order 5 


ODOUR 
be} E Ha 
ce] [tte] ANd Ley dede 


To make the processing of m-way trees easier some type of order will be imposed on the keys 
within each node, resulting in a multiway search tree of order m (or an m-way search tree). 
By definition an m-way search tree is a m-way tree in which: 


e Each node has m children and m-1 key fields 
e The keys in each node are in ascending order. 
e The keys in the first 1 children are smaller than the ith key 
e The keys in the last m-i children are larger than the ith key 


4-way search tree 
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M-way search trees give the same advantages to m-way trees that binary search trees gave to 
binary trees - they provide fast information retrieval and update. However, they also have the 
same problems that binary search trees had - they can become unbalanced, which means that the 
construction of the tree becomes of vital importance. 

B Trees: 


An extension of a multiway search tree of order m is a B-tree of order m. This type of tree will 
be used when the data to be accessed/stored is located on secondary storage devices because they 
allow for large amounts of data to be stored in a node. 


A B-tree of order m is a multiway search tree in which: 


1. The root has at least two subtrees unless it is the only node in the tree. 

2. Each nonroot and each nonleaf node have at most m nonempty children and at least m/2 
nonempty children. 

3. The number of keys in each nonroot and each nonleaf node is one less than the number of 
its nonempty children. 

4. All leaves are on the same level. 


These restrictions make B-trees always at least half full, have few levels, and remain perfectly 
balanced. 


Searching a B-tree 

An algorithm for finding a key in B-tree is simple. Start at the root and determine which pointer 
to follow based on a comparison between the search value and key fields in the root node. 
Follow the appropriate pointer to a child node. Examine the key fields in the child node and 
continue to follow the appropriate pointers until the search value is found or a leaf node is 


reached that doesn't contain the desired search value. 


Insertion into a B-tree 
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The condition that all leaves must be on the same level forces a characteristic behavior of B- 
trees, namely that B-trees are not allowed to grow at the their leaves; instead they are forced to 
grow at the root. 


When inserting into a B-tree, a value is inserted directly into a leaf. This leads to three common 
situations that can occur: 


1. A key is placed into a leaf that still has room. 
2. The leaf in which a key is to be placed is full. 
3. The root of the B-tree is full. 


Case 1: A Key is placed into a leaf that still has room 


This is the easiest of the cases to solve because the value is simply inserted into the correct sorted 
position in the leaf node. 





Case 2: The leaf in which a key is to be placed is full 

In this case, the leaf node where the value should be inserted is split in two, resulting in a new 
leaf node. Half of the keys will be moved from the full leaf to the new leaf. The new leaf is then 
incorporated into the B-tree. 

The new leaf is incorporated by moving the middle value to the parent and a pointer to the new 
leaf is also added to the parent. This process is continues up the tree until all of the values have 


"found" a location. 


Insert 6 into the following B-tree: 
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The new node needs to be incorporated into the tree - this is accomplished by taking the middle 
value and inserting it in the parent: 





Case 3: The root of the B-tree is full 


The upward movement of values from case 2 means that it's possible that a value could move up 
to the root of the B-tree. If the root is full, the same basic process from case 2 will be applied and 
a new root will be created. This type of split results in 2 new nodes being added to the B-tree. 


Inserting 13 into the following tree: 
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Results in: 


ms 
A A a Sa 


The 15 needs to be moved to the root node but it is full. This means that the root needs to be 


divided: 








—— 


18 | 19 21) 23) 5) 8) AIRAS 














Deleting from a B-tree 


As usual, this is the hardest of the processes to apply. The deletion process will basically be a 
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reversal of the insertion process - rather than splitting nodes, it's possible that nodes will be 
merged so that B-tree properties, namely the requirement that a node must be at least half full, 
can be maintained. 


There are two main cases to be considered: 


1. Deletion from a leaf 
2. Deletion from a non-leaf 


Case 1: Deletion from a leaf 


la) If the leaf is at least half full after deleting the desired value, the remaining larger values are 
moved to "fill the gap". 


Deleting 6 from the following tree: 


pull 
H Pal 
og SO EO 


results in: 


JENN 
uu a 
EED OPED G BLD L 


1b) If the leaf is less than half full after deleting the desired value (known as underflow), two 
things could happen: 


Deleting 7 from the tree above results in: 
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1b-1) If there is a left or right sibling with the number of keys exceeding the minimum 
requirement, all of the keys from the leaf and sibling will be redistributed between them by 
moving the separator key from the parent to the leaf and moving the middle key from the node 


and the sibling combined to the parent. 
DCER EONA 
eU BL HL eA ee peL 


Now delete 8 from the tree: 


ed 
BCA iat | 
tT tT PE eee eed e ele 


1b-2) If the number of keys in the sibling does not exceed the minimum requirement, then the 
leaf and sibling are merged by putting the keys from the leaf, the sibling, and the separator from 
the parent into the leaf. The sibling node is discarded and the keys in the parent are moved to 
"fill the gap". It's possible that this will cause the parent to underflow. If that is the case, treat the 
parent as a leaf and continue repeating step 1b-2 until the minimum requirement is met or the 


root of the tree is reached. 


Special Case for 1b-2: When merging nodes, if the parent is the root with only one key, the keys 
from the node, the sibling, and the only key of the root are placed into a node and this will 
become the new root for the B-tree. Both the sibling and the old root will be discarded. 
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one 


16 | 22 
TT GER GT BLATT 


Case 2: Deletion from a non-leaf 





This case can lead to problems with tree reorganization but it will be solved in a manner similar 
to deletion from a binary search tree. 


The key to be deleted will be replaced by its immediate predecessor (or successor) and then the 
predecessor (or successor) will be deleted since it can only be found in a leaf node. 


TLE EA GBT] A ELL 


The "gap" is filled in with the immediate predecessor: 


Deleting 16 from the tree above results in: 
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co 


and then the immediate predecessor is deleted: 














The vales in the left sibling are combined with the separator key (18) and the remaining values. 
They are divided between the 2 nodes: 
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and then the middle value is moved to the parent: 








Hashing and Collision: 


Hashing is the technique used for performing almost constant time search in case of insertion, deletion 
and find operation. Taking a very simple example of it, an array with its index as key is the example of 
hash table. So each index (key) can be used for accessing the value in a constant search time. This 
mapping key must be simple to compute and must helping in identifying the associated value. Function 
which helps us in generating such kind of key-value mapping is known as Hash Function. 


In a hashing system the keys are stored in an array which is called the Hash Table. A perfectly 
implemented hash table would always promise an average insert/delete/retrieval time of O(1). 


Hashing Function: 


A function which employs some algorithm to computes the key K for all the data elements in the set U, 
such that the key K which is of a fixed size. The same key K can be used to map data to a hash table 
and all the operations like insertion, deletion and searching should be possible. The values returned by 
a hash function are also referred to as hash values, hash codes, hash sums, or hashes. 


hash 
keys function hashes 
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Hash Collision: 


A situation when the resultant hashes for two or more data elements in the data set U, maps to the same 
location in the has table, is called a hash collision. In such a situation two or more data elements would 
qualify to be stored / mapped to the same location in the hash table. 


Hash collision resolution techniques: 
Open Hashing (Separate chaining): 


Open Hashing, is a technique in which the data is not directly stored at the hash key index (k) of the Hash 
table. Rather the data at the key index (k) in the hash table is a pointer to the head of the data structure 
where the data is actually stored. In the most simple and common implementations the data structure 
adopted for storing the element is a linked-list. 


keys buckets entries 
pend [=| Usasmm | 52-8976 | 
|=| Usasmm | 52-3976 | 
aR a 
A \ f a= ETIN 
ill -O xp nise | asies | 


- aE 
n this technique when a data needs to be searched, it might become necessary (worst case) to traverse 


all the nodes in the linked list to retrieve the data. 


Note that the order in which the data is stored in each of these linked lists (or other data structures) is 
completely based on implementation requirements. Some of the popular criteria are insertion order, 
frequency of access etc. 


Closed hashing (open Addressing) 


In this technique a hash table with pre-identified size is considered. All items are stored in the hash 
table itself. In addition to the data, each hash bucket also maintains the three states: EMPTY, 
OCCUPIED, DELETED. While inserting, if a collision occurs, alternative cells are tried until an empty 
bucket is found. For which one of the following technique is adopted. 
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1. Liner Probing 


Quadratic probing 


3. Double hashing (in short in case of collision another hashing function is used with the key value 
as an input to identify where in the open addressing scheme the data should actually be stored.) 


A comparative analysis of Closed Hashing vs Open Hashing 


Open Addressing Closed Addressing 


All elements would be 
stored in the Hash table 
itself. No additional data 
structure is needed. 


In cases of collisions, a 


Determining size of the 
hash table, adequate enough 


State needs be maintained 
for the data (additional 
work) 


Additional Data structure 
needs to be used to 
accommodate collision 
data. 


Simple and effective 
approach to collision 
resolution. Key may or may 
not be unique. 


Performance deterioration 
of closed addressing much 
slower as compared to 
Open addressing. 


No state data needs to be 
maintained (easier to 
maintain) 


Uses space efficiently 


Applications of Hashing: 





A hash function maps a variable length input string to fixed length output string -- its hash value, or 


hash for short. If the input is longer than the output, 


then some inputs must map to the same output -- a 


hash collision. Comparing the hash values for two inputs can give us one of two answers: the inputs are 
definitely not the same, or there is a possibility that they are the same. Hashing as we know it is used for 
performance improvement, error checking, and authentication. One example of a performance 
improvement is the common hash table, which uses a hash function to index into the correct bucket in 
the hash table, followed by comparing each element in the bucket to find a match. In error checking, 
hashes (checksums, message digests, etc.) are used to detect errors caused by either hardware or 
software. Examples are TCP checksums, ECC memory, and MD5 checksums on downloaded files’ In 
this case, the hash provides additional assurance that the data we received is correct. Finally, hashes are 
used to authenticate messages. In this case, we are trying to protect the original input from tampering, 
and we select a hash that is strong enough to make malicious attack infeasible or unprofitable. 


e Construct a message authentication code (MAC) 


e Digital signature 


e Make commitments, but reveal message later 


e Timestamping 


e Key updating: key is hashed at specific intervals resulting in new key 
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